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HSAN il RELATED GENE AND EXPRESSION 
PRODUCTS AND USES THEREOF 

5 

This application claims priority of U.S. Provisional Applications 
60/502,453, filed 12 September 2003, and 60/425,601, filed 12 November 
2002, the disclosures of which are hereby incorporated by reference in their 
entirety. 

10 

FIELD OF THE INVENTION 

1 5 The present invention relates to methods of utilizing a gene (HSN2) 

involved in Hereditary Sensory and Autonomic Neuropathy, type II (HSAN 11) 
and expression products of this gene for the screening and identification of 
agents, such as small organic compounds, useful in the treatment of pain, 
neuropathy and related disorders in human patients as well as uses thereof to 

20 treat or otherwise ameliorate such disorders. 



25 BACKGROUND OF THE INVENTION 

The present invention relates to the medical disorder called Hereditary 
Sensory and Autonomic Neuropathy - Type II ("HSAN ir')» which is a member 

30 of a group of hereditary pain disordei^ known as the Hereditary Sensory and 
Autonomic Neuropathies (HSAN). HSAN comprises a group of five clinically 
and genetically heterogeneous disorders which are mainly characterized by 
variable sensory and autonomic dysfunction, including absence of pain. The 
pathology of HSAN is characterized by degeneration of peripheral sensory 

35 neurons and it appears to be hereditary, making it of considerable scientific 
interest to determine which gene or genes underiies the disorder. 
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HSAN II is an autosomal recessive condition with distal generalized 
sensory loss. (See Online Mendellan Inheritance in Man (OMIM) reference 
*201300) Patients with HSAN II typically present with "glove and sock" 
distribution, meaning that they have reduced or complete loss of pain, 

5 temperature and touch sens&tions In the lower legs and feet as well as hands 
and foreamis. Age of onset is quite early, usually In the first decade of life. 
They first complain of a numbness In their extremities which is aggravated by 
the cold, after which pain sensation Is reduced. Patients have difficulty with 
handling small objects (like coins) In their pockets due to the lack of 

0 sensation. Some families appear to have a slowly progressive form, where 
the lack of sensation slowly grows up the legs and arms until in some cases, 
the patients have reduced sensations in their trunkal regions while others do 
not Reduced sensation is confined just to the lower legs (usually below the 
knees), forearms and hands. Muscle atrophy (particularly in calves, and in 

5 between the thumb and first finger), ulcerations and infections are major 
secondary features. Amputations of fingers, toes and sometimes feet and 
lower limbs, resulting from infections, are not uncommon. Reflexes are 
diminished or lost. There is very little, if any, autonomic involvement, so 
sweating and blood pressure are normal. 

20 

The pathology of HSAN 11 appears to be tied to peripheral nerve 
degeneration. Biopsy reveals that within the sural nerve there Is a severe loss 
of myelinated axons, and a lesser loss of nonmyelinated fibres. No cutaneous 
sensory receptors or nerve fibers are seen. (Axelrod, F.B. "Autonomic and 
25 sensory disorders". Chapter 117; pages 3146-3161 in Emerv and Rimoln's 
Principles and Practice of Medical Genetics (Fourth Edition) Edited by: David 
L. Rimoin, J. Michael Connor, Reed E. Pyeritz, Bnjce R. Korf. Published: 
Harcourt Publishers Limited London 2002). 

30 The genetic basis of HSAN 11 Is yet to be determined. No genetic loci 

have been confirmed, and no mutations in suggested candidate genes have 
been identified. 
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The HSAN II gene (dubbed herein "HSN2") would present a novel 
target for therapeutic agents, The functional validation disclosed herein shows 
that the activity of the gene/protein is involved in very specific human disease 
processes. Therapeutic agents which modulate the biological activity of the 
5 HSAN II gene or its conresponding protein are novel therapeutic agents for the 
treatment of pain, neuropathy and related disorders. 

The screening assays disclosed herein select from a large library of 
compounds those compounds that interact with, bind to or otherwise modulate 
1 0 the activity of the target gene/protein. A wide body of commercial literature 
describes the screening of chemical libraries of diverse potential therapeutic 
agents to identify such potential therapeutic agents. 

Therapeutic agents for treatment of pain fall into two main classes - the 
15 NSAIDs (non-steroidal anti-inflammatory drugs) and the opioids. NSAIDs 
treat pain in a way similar to the mechanism of aspirin, the most well-known 
and oldest member of the class. Common NSAIDs include acetaminophen, 
ibuprofen and naproxen. These drugs mainly inhibit the body's ability to 
synthesize prostaglandins. The common mechanism of action for all NSAIDs 
20 is the inhibition of the enzyme cyclooxgenase (COX). A major commercial 
success has been achieved with specific inhibitors of COX-2, such as 
Celebrex™ from Pharmacia/Pfizer, and Vioxx™ from Merck & Co. 

Opioids act through the opioid receptor family. These drugs include 
25 the weak opioids such as codeine and Tylenol 3, and strong opioids such as 
morphine and methadone. Some are long acting, others are of short duration. 
Opioid analgesics have a tendency to addiction and. dependency, and so are 
not preferred for long-term or chronic pain management 

30 Outside of the NSAIDs and opioids, there are a number of other 

suggested analgesic agents in clinical trials (I.e. not yet approved for 
marketing) which are believed to have alternative targets. Some clinical trials 
are attempting to establish that central neuropathic pain may respond to ion 
channel blockers such as blockers pf calcium, sodium and/or NMDA (N- 
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methyl-D-aspartate) channels. The literature provides substantial pre-dinical 
electrophysiological evidence in support of the use of NMDA antagonists in 
the treatment of neuropathic pain. Such agents also may find use in the 
control of pain after tolerance to opioid analgesia occurs, particularly in cancer 
5 patients. 

Current treatment of neuropathies first entails identifying the cause, 
then treating the cause if possible. There is a substantial list of disorders that 
can result in neuropathies, Including diabetes arid other diseases. Most 

10 neuropathies (except the genetic forms) are treatable by treating the root 
causes, though most treatments are only partially effective. For example 
vitamin deficiency is treated with vitamins; Guillain-Barre and CIDP are 
treated with gamma globulins and plasmapheresis. For genetic forms of 
neuropathy, current clinical practice is to manage patients by supporting their 

1 5 efforts to avoid self-inflicted injury. 

In accordance with the present invention, the identification of the 
hereditary basis for HSAN li facilitates developing more potent agents for 
treating pain, neuropathy, and related disorders. The underlying genetic 
20 mutation provides a therapeutic target for noyel therapeutic agents. This 
therapeutic target pemiits identification and discovery of more effective 
analgesics and other therapeutic agents, as well as new methods and 
compositions for diagnosis of HSAN II distinguish between types of inherited 
pain disorders. 

25 
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BRIEF SUMMARY OF THE INVENTION 

In one aspectr the invention relates to the nucleic acid sequence for 
HSN2, including the genomic sequence, mRNA or cDNA. polymorphic, allelic, 
5 isoforms (adult, neo-natal, etc.) and mutant forms thereof, and nucleic acid 
constructs of the gene, including vectors, plasmids and recombinant cells and 
transgenic organisms containing or corresponding to HSN2 (or knock-outs 
thereof). 

10 In another aspect, the invention relates to the gene product of HSN2 

(sensorin). including the polypeptide, protein, and amino acid sequence, and 
the polymorphic, allelic, isoforms (adult, neo-natal, etc.) and mutant forms 
thereof, and recombinant cells and transgenic organisms wherein this 
polypeptide or a polypeptide corresponding thereto is expressed. 

15 

In another aspect of the present invention, the HSN2 gene or protein is 
incorporated into a screening assay whereby test compounds (potential 
therapeutic agents) are evaluated, for their ability to modulate HSN2 gene 
expression or sensorin activity, thereby identifying modulators of the gene or 
20 protein and thus potential therapeutic agents. 

In preferred embodiments, the polynucleotide whose expression is to 
be measured or monitored is present in an intact cell, preferably a mammalian 
cell, most preferably a peripheral neuron, and may include a recombinant cell. 
25 Such polynucleotide may also be present outside of a cell and the expression 
may be measured in vitro. 

In another aspect, the present invention relates to a method for 
identifying an agent that modulates the activity of a polypeptide encoded by a 
polynucleotide as disclosed herein. 

30 

In a further aspect, the present invention relates to a method for 
identifying an agent for the treatment of pain, neuropathy or a related disorder 
in an animal. Preferably, the animal is a mammal, such as a human being. In 
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specific embodiments, the pain stimulus is a lieat stimulus and reaction or 
sensitivity to hot and/or cold may be measured. In another embodiment, an 
electrical stimulus may be used. 

5 In a preferred embodiment of the invention; the compound identified 

which modulates HSN2 (or sensorin) is selective for the target ahead of 
related genes or proteins. 

In a further aspect, the present invention relates to a method for 
treating a condition in an animal afflicted with chronic pain, neuropathy or a 

10 related disorder comprising administering to said animal an effective amount 
of an agent first identified by an assay method of the invention. Preferably, 
said animal is a human patient. Such determined agents may also be applied 
to alternative or additional indications beyond pain, neuropathy or other 
disorders, which are found to be treatable by modulating HSN2 or sensorin 

1 5 activity. 

In a further aspect, the present invention relates to compounds that 
modulate HSN2 or sensorin. These compounds include antibodies, antisense 
compounds, gene therapy vectors and proteins, and small molecule organic 
compounds. The invention also comprises the use of these compounds in the 
20 treatment of pain, neuropathy and related disorders. 



BRIEF DESCRIPTION OF THE DRAWINGS 

25 . 

Figure 1a shows the pedigree for a family designated HSAN4 showing 
family relationships and haplotypes. Figure 1b shows the pedigree for a 
family designated HSAN3 showing family relationships and haplotypes. 
Squares and circles represent males and females, respectively. Filled 
30 symbols indicate individuals with HSAN II. Symbols with an "N" indicate 
individuals diagnosed as normal, and clear symbols and symbols with a 
question mark indicate individuals who have not been diagnosed 



wo 2004/043999 




:T/CA2003/001716 



Figure 2. Domain prediction of the HSN2 , protein, sensorin. TM 
topology was predicted using TMPred. 

Figure 3. Peptide sequence alignment of human, mouse, rat, fugu and 
5 tetraodon sensorin proteins. 

Figure 4. Peptide sequence alignment of human and Fugu sensorin. 

Figure 5. Peptide sequence alignment of Fugu and Zebrafish orthologs 
10 of sensorin. 

Figures 6A-6E shows the sequences of HSN2 from various sources. In 
all cases, cDNA sequences are shown in capital letters. Start and stop codons 
are underilned. 

15 

Figure 6A shows a sequence (SEQ ID NO: 1) for wild-type human 
HSN2 putative cDNA (with ends based on human ESTs). 

Figures 6B-1 and 6B-2 shows the promoter sequence (SEQ ID NO: 6) 
20 for human HSN2 (presumably there only 1 promoter used and it is not 
associated with a CpG-rich island; this corresponds to the segment of 
PRKWNK1 intron 8 upstream of the HSN2 start codon; the transcription start 
site and cDNA sequence (shown in capital letters) for HSN2 is tentative and 
based on 1 EST sequence; the translation start site, and start ATG, is the last 
25 3 bases. . . 

Figure 6C shows the nucleotide sequence (SEQ ID NO: 7) of wild-type 
mouse HSN2 cDNA predicted (5' and 3' ends not detemiined). The putative 
polypeptide is SEQ ID NO: 8. 

30 

Figure 6D shows the nucleotide sequence (SEQ ID NO: 9) for wild-type 
rat HSN2 (with the genomic sequence defined by 5' and 3' ends in EST 
BF522762/AI5781 84, and containing a polyA tail). 
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Figure 6E-1 to 6E-6 show the genomic structure and sequence of the 
human HSN2. (SEQ ID NO. 12), with the figures following in sequence. The 
HSN2 gene Is a single unspliced exon found in intron 8 of the PRKWNK1 
gene (also called XH03, below). Here, italics = interspersed repeats. 
5 uppercase = exons, bold = exons (or part of exons) found in common 
isoforms and underlined = start and stop codons, or upstream in-frame stop 
codon. 

Figures 7A-7C show the sequences of cDNAs for three separate 
1 0 mutations identified in individuals afflicted with HSAN II. Start and stop codons 
are underlined. Figure 7A shows mutation 1 (SEQ ID NO: 14) as human 
cDNA with mutation c.594delA. Figure 7B shows mutation 2 (SEQ ID NO: 15) 
as human cDNA with mutation c.918insA. Figure 7C shows mutation 3 (SEQ 
ID NO: 16) as human cDNA with mutation C.9430T. 

15 

Figure 8 shows three in-frame candidate translation initiation codons. 

Figure 9 shows a sequence alignment between the human and pig 
peptide sequences. 



DEFINITIONS 

25 The following terms have their indicated meaning unless expressly stated 

otherwise herein. 

As used herein, the term "percent identity" or "percent identical " when 
refening to a sequence, means that a sequence is compared to a claimed or 
30 described sequence after alignment of the sequence to be compared (the 
"Compared Sequence") with the described or claimed sequence (the 
"Reference Sequence"). The Percent Identity is then determined according to 
the following formula: 



wo 2004/043999 




:T/CA2003/001716 



Percent Identity = 100[1-(C/R)1 

wherein C is the number of differences between the Reference Sequence and 
the Compared Sequence over the length of alignment between the Reference 
5 Sequence and the Compared Sequence wherein (i) each base or amino acid in 
the Reference Sequence that does not have a corresponding aligned base or 
amino acid in the Compared Sequence and (ii) each gap in the Reference 
Sequence and (iii) each aligned base or amino acid in the Reference Sequence 
that is different from an aligned base or amino acid in the Compared Sequence, 
1 0 constitutes a difference; and R is the number of bases or amino acids in the 
Reference Sequence over the length of the alignment with the Compared 
Sequence with any gap created in the Reference Sequence also being counted 
as a base or amino acid. 

15 If an alignment exists between the Compared Sequence and the 

Reference Sequence for which the percent identity as calculated above is about 
equal to or greater than a specified minimum Percent Identity then the 
Compared Sequence has the specified minimum percent identity to the 
Reference Sequence even though alignments may . exist in which the 

20 hereinabove calculated Percent Identity is less than the specified Percent 
Identity. 

As used herein, the temns "portion," . "segment," and '•fragment," when 
used in relation to polypeptides, refer to a continuous sequence of nucleotide 
25 residues, sequence forms a subset of a larger sequence. Such terms include 
the products produced by treatment of said polynucleotides with any of the 
common endonucleases, or any stretch of polynucleotides that could be 
synthetically synthesized. These may include exonic and intronic sequences of 
the corresponding genes. 

30 

As used herein the temi "encode" and its derivatives mean "can be 
transcribed into" or "can be translated into" so that a gene or other deoxy 
polynucleotide sequence can be described in terms of its ability to encode an 
RNA or polypeptide while the latter terms relate to materials that can be 
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described in terms of their being encoded by said gene or deoxy 
polynucleotide. In the sanne way. an RNA can be said to encode a polypeptide 
while said polypeptide can be described as being encoded by said RNA. 

5 The words "therapeutic target^' mean a gene, an RNA or polypeptide for 

which therapeutic intervention is achieved with agents that modulate the 
activity of said gene, RNA or protein. ''Modulate" means to increase, to 
decrease, or to otherwise change the expression or activity, especially of a 
therapeutic target. 

10 

"HSN2 activity" or "sensorin activity" as used herein, especially relating 
to screening assays, is to be interpreted broadly and contemplates ail directly 
or indirectly measurable and identifiable biological activities of the HSN2 
gene, gene products and sensorin. The HSN2 gene is thus involved with 
1 5 functions related to Hereditary Sensory arxd Autonomic Neuropathy - Type II 
("HSAN ir) and is, in the past, has also been called the "HSAN II 
Susceptibility Gene." 

The term "polynucleotide" is used interchangeably with "gene", "cDNA", 
20 "mRNA", "oligonucleotide", and "nucleic acid". 

The temis "polypeptide," "peptide," and "protein" are used 
interchangeably herein to refer to a polymer of amino acid residues. The 
terms apply to amino acid polymers in which on^ or more amino acid residue 
25 is an analog or mimetic of a corresponding naturally occurring amino acid, as 
well as to naturally occurring amino acid polymers. 

The temn "agent" is used interchangeably with the term "compound" 
and likewise the term "test agent" is used interchangeably with the term "test 
30 compound". 

The term "operably linked" refers to a functional linkage between a 
nucleic acid expression control sequence (such as a promoter, or array of 
transcription factor binding sites) and a second nucleic acid sequence. 
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wherein the expression control sequence directs transcription of the nucleic 
acid corresponding to the second sequence. 

The tenn "mammar refers to a member of the zoological class 
5 Mammalia. Examples of mammals Include, without limitation, humans, 
primates, chimpanzees, mice, rats, rabbits, sheep, and cows. 

The tenn "treat" or "treatment" encompasses therapeutic treatment, 
preventive treatment and protective treatment. Such protective treatment 
1 0 includes treatment of patients, especially to achieve analgesia. 

As used herein, the term "correspond" applied to a gene or 
polynucleotide, such as a DNA, means that the gene has the indicated 
nucleotide sequence or that it encodes substantially the same RNA as would 
1 5 be encoded by the indicated sequence, the term "substantially" meaning at 
least 95% identical as defined elsewhere herein. 

The term "DNA segment" refers to a DNA polymer, in the form of a 
separate fragment or as a component of a larger DNA construct, which has 

20 been derived from DNA isolated at least once in substantially pure form, i.e., 
free of contaminating endogenous materials and in a quantity or concentration 
enabling identification, manipulation, and recovery of the segment and its 
component nucleotide sequences, by standard, biochemical methods, for 
example, using a cloning vector. Such segments are provided in the form of 

25 an open reading frame uninterrupted by internal nontranslated sequences, or 
introns, which are typically present in eukaryotic genes. Sequences of non- 
translated DNA may be present downstream firom th^ open reading frame, 
where the same do not interfere with manipulation or expression of the coding 
regions. 

30 

"Isolated" in the context of the present invention with respect to 
polypeptides (or polynucleotides) means that the material is removed from its 
original environment {e.g., the natural environment if it is naturally occurring). 
For example, a naturally-occurring polynucleotide or polypeptide present in a 
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living organism is not isolated, but the same polynucleotide or polypeptide, 
separated from some or all of the co-existing materials in the natural system, is 
isolated. Such polynucleotides could be part of a vector and/or such 
polynucleotides or polypeptides could be part of a composition, and still be 
5 isolated in that such vector or composition is not part of its natural environment. 
The polypeptides and polynucleotides of the present invention are preferably 
provided in an isolated form, and preferably are purified to homogeneity. 

The polynucleotides, and recombinant or immunogenic polypeptides, 
1 0 disclosed in accordance with the present invention may also be in "purified" 
form. The term "purified" does not require absolute purity; rather, it is 
intended as a relative definition, and can include preparations that are highly 
purified or preparations that are only partially purified, as those terms are 
understood by those of skill in the relevant art. For example, individual clones 
15 isolated from a cDNA library have been conventionally purified to 
electrophoretic homogeneity. Purification of starting material or natural 
material to at least one order of magnitude, preferably two or three orders, 
and more preferably four or five orders of magnitude is expressly 
contemplated. Furthermore, claimed polypeptide which has a purity of 
20 preferably 0.001%, or at least 0.01% or 0.1%; and even desirably 1% by 
weight or greater is expressly contemplated. 

The term "coding region" refers to that portion of a gene which either 
naturally or normally codes for the expression product of that gene in its 
25 natural genomic environment, i.e., the region coding in vivo for the native 
expression product of the gene. The coding region can be from a normal, 
mutated or altered g^ne, or can even be from a DNA sequence, or gene, 
wholly synthesized in the laboratory using methods well known to those of 
skill in the art of DNA synthesis. 

30 

In accordance with the present invention, the term "nucleotide 
sequence" refers to a heteropolymer of deoxyribonucleotides. Generally, DNA 
segments encoding the proteins provided by this Invention are assembled 
from cDNA fragments and short oligonucleotide linkers, or from a series of 
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oligonucleotides, to provide a synthetic gene wiiich is capable of being 
expressed In a recombinant transcriptional unit comprising regulatory 
elements derived from a microbial or viral operon. 

5 The temn "expression product" means that polypeptide or protein that is 

the natural translation product of the gene and any nucleic add sequence 
coding equivalents resulting from genetic code degeneracy and thus coding 
for the same amino acid(s). 

10 The term "fragment," when referring to a coding sequence, means a 

portion of DNA comprising less than the complete coding region whose 
expression product retains essentially the same biological function or activity 
as the expression product of the complete coding region! 

1 5 The term "primer" means a short nucleic acid sequence that is paired 

with one strand of DNA and provides a free 3'OH end at which a DNA 
polymerase starts synthesis of a deoxyribonucleotide chain. 

The term "promoter" means a region of DNA involved in binding of 
20 RNA polymerase to initiate transcription. 

The term "open reading frame (ORF)" means a series of triplets coding 
for amino acids without any termination codons and is a sequence 
(potentially) translatable into protein. 

25 

As used herein, reference to a DNA sequence includes both single 
stranded and double stranded DNA. Thus, the specific sequence, unless the 
context indicates otherwise, refers to the single strand DNA of such 
sequence, the duplex of such sequence with its complement (double stranded 
30 DNA) and the complement of such sequence. 
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DETAILED DESCRIPTION OF THE INVENTION 

In accordance with the present invention, the human HSN2 Gene, is 
disclosed. This gene, when mutated, results in Hereditary Sensory and 

5 Autonomic Neuropathy - Type II ("HSAN 11"). This gene is designated herein 
as HSN2 (but in the past has sometimes been referred to as SENS or XH18 
are other designations). The polypeptide is referred to as HSN2 Protein or 
HSN2 peptide. In the past, it has also been referred to as HSAN II 
polypeptide, HSN2 polypeptide, HSANII peptide. It is also refen-ed to as 

O sensorin, although this latter tenm has been applied to a minor unrelated 
protein in the past. This naming is not essential to use of the invention 
claimed herein. 

Thus, the present invention provides a gene and its corresponding 
5 protein related to insensitivity to pain, neuropathy and related disorders, 
referred to as HSN2 and its corresponding polypeptide sensorin or HSN2 
protein. 

The wild type, nucleic acid sequence of human HSN2 is provided at 
20 SEQ ID NO. 1. The wild type amino acid sequence of human HSN2 protein is 
set forth in SEQ ID NO: 2. . 

Three in-frame candidate translation Initiation codons have been 
identified (Fig. 8). The 3"^ ATG is closest to the optimal Kozak consensus C- 
25 C-[AG]-C-C-A-T-G-G context for initiation of translation, but is not conserved 
in the rat sequence. Those skilled in the art are capable of characterizing the 
most efficient translation initiation context by generating three HSN2 cDNA 
expression vectors, each containing different starting site combinations 
including one, two or three ATG codons. 

30 

The human HSN2 gene maps to the HSAN II candidate interval at 
12p13.33. The gene encodes a fulWength conserved peptide of 434 amino 
acids. It comprises a single unspliced exon which maps to intron 8 of the 
PRKWNK1 gene and is transcribed from the same strand as PRKWNK1. 
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HSN2-encodecl protein may be the full length PRKWNK1 protein with a 
special exon. Thus, the present invention can also encompass a full length 
PRKWNK1 protein (including cDNA) containing the HSN2 exon. 

5 In one aspect, the present invention relates to an isolated 

polynucleotide comprising a polynucleotide having a nucleotide sequence with 
at least 60% identity to, preferably at least 70% identity to, more preferably at 
least 78% Identity to, even more preferably at least 90% identity to, most 
preferably at least 95% identity to, and especially at least 98% identity to a 
10 sequence selected from the group consisting of SEQ ID NO: 1, 6, 7, 9 and 12 
wherein said isolated polynucleotide encodes a polypeptide that binds to an 
antibody specific for a polypeptide having the amino acid sequence of SEQ ID 
NO: 2. In an especially preferred embodiment, the isolated polynucleotide 
comprises the nucleotide sequence of SEQ ID NO: 1, 6, 7, 9 or 12. 

15 

In another aspect, the present invention relates to an isolated 
polypeptide comprising a polypeptide having an amino acid sequence with at 
least 90% identity to, preferably at least 95% identity to, more preferably at 
least 98% identity to and most preferably having the amino acid sequence 
20 selected from the group consisting of SEQ ID NO: 2, 8, 10, 1 1 and 13 wherein 
said polypeptide binds to an antibody specific for a polypeptide having the 
amino acid sequence of SEQ ID NO: 2, 

As disclosed herein, three separate mutations were identified in 
25 individuals affected with HSAN II, none of which are found in the homozygous 
state in unaffected Individuals (see Table 3, SEQ ID NO: 3 to 5). Each 
mutation affects the open reading frame (ORF) of the HSN2 gene. The cDNA 
sequences for these mutations are . shown in Figure 7 and may be 
summarized as follows: 

30 

Mutation 1. c.594delA; frameshift in codpn 198, protein truncated to 
206 aa: A deletion of an A (adenine) at position 594 in the nucleotide 
sequence (relative to the start ATG), found homozygous in the HSAN4-70 
sample, causes a frameshift in codon 198 leading to premature truncation of 
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a 206 aa peptide. The predicted amino acid sequence of the truncated human 
sensorin sequence from Mutation 1 is set forth at SEQ ID NO. 3. The same 
mutation was also later found to be responsible for the disease in the HSAN3 
family. This deletion segregated with disease in other members of the HSAN3 
5 and HSAN4 families. The mutation was not detected in any of 217 nomnal 
chromosomes from individuals of European descent. 

Two different mutations were found in the HSAN5-302 sample. 

10 Mutation 2. c.918insA, firameshifl in codon 307. protein truncated to 

318 aa. This mutation is a 1 bp insertion of an A between bases 918-919, 
causing a frameshift in codon 307, and leading to premature truncation to 
produce a 318 aa (amino acid) peptide. The predicted amino acid sequence 
of the truncated human sensorin sequence from Mutation 2 is set forth at SEQ 

1 5 ID NO. 4 

Mutation 3. c.943C>T; Gln315stop; protein tmncated to 314 aa. This 
mutation is a C>T change at position 943 that changes codon 315 (GAG, 
encoding glutamine) to a TAG stop codon, therefore prematurely truncating 
20 the protein at 314 aa. Each of these mutations was not found in the control 
sample, dbSNP database, nor available gianomic and EST sequences. The 
predicted amino acid sequence of the truncated human sensorin sequence 
from Mutation 3 is set forth at SEQ ID NO. 5. The same mutation was also 
later found to be responsible for the disease in the HSAN6 family. 

25 

Segregation of mutations In HSN2 with affected HSAN 11 patients has 
now been confimned in 5/5 affecteds and 0 control (unaffected) individuals. 

Promoter: A sequence containing the promoter sequence is set forth 
30 at SEQ ID NO. 6. It includes the portion of PRKWNK1 intron 8 upstream of 
the HSN2 translation initiation site. There are a number of DNA elements that 
are conserved between human and mouse in the region upstream of the 
presumed transcription initiation site that may act as promoter elements, 
including silencers and enhancers, and other transcription factor binding sites. 
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Other single nucleotide polymorpiiisms (SNPs) and variations: there is 
only 1 documented SNP (rs1054186) in the HSN2 gene, which corresponds 
to a C>T change at position 1690 of the cDNA, located in the 3'UTR. This 
5 putative SNP may not be real, as it is found in only one published expressed 
sequence tag (EST)and the quality of the sequencing is too poor to evaluate 
additional SNPs in this gene. 

BAG contig: The HSN2 gene maps to BAG (Bacterial Artificial 
10 Ghromosome) clone AG004766, and there are no gaps in the genomic 
structure. Based on the structure of the HSN2 gene detemiined above, the 
gene spans a maximum of 3 kb, consists of a single exon mapping within 
intron 08 of the XH03 gene (PRKWNK1), and is transcribed from the same 
strand as XH03, namely tel 5'-HSN2-3' cen in 12p13.33. There are probably 
15 no additional spliced exons, since these would interfere with splicing of XH03. 
The position of HSN2 within an intron of the PRKWNK1 gene Is conserved in 
human, mouse, rat and fish (form example, Fugu and Tetraodon). 

Markers: D12S91 is the closest marker, mapping ~29 kb telomeric to 

20 HSN2. 

Genomic sequence: the full human genomic sequence illustrating the 
nucleic acid sequence around HSN2 is set forth at SEQ ID NO. 12. 

25 

Gonservation of HSN2 in other organisms. 

Orthologs of the human HSN2 gene were found in mouse, rat and fish 
(Takifugu rubripes ("Fugu"), Danio rerio ("Zebrafish"), and Tetraodon 
30 nigroviridis ("Tetraodon").) 

A conserved mouse ORF, and flanking sequences were identified from 
a mouse BAG AC1 13092. The nucleic acid sequence of mouse HSN2 is set 
forth at SEQ ID NO. 7. The translated amino acid sequence of mouse HSN2 
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is set forth at SEQ ID NO. 8. No mouse ESTs of HSN2 were identified in a 
comprehensive public database search. 

Likewise the rat ortholog was identified from rat genomic sequences, 
5 and verified with the sequence of one rat EST (BF522762). This EST was 
used to identify the conserved rat ORF based on the genomic sequence 
found in BAG sequences AC1 06348 and AC1 06932. The nucleic acid 
sequence of wild-type rat HSN2 is set forth at SEQ ID NO. 9. The translated 
amino acid sequence of wild-type rat sensorin Is set forth at SEQ ID NO. 10. 

10 

Analysis of human, mouse and rat orthoiogs. The human protein , is 
87% identical to mouse, and 85% identical to rat The upstream stop codon 
immediately preceding the start ATG Is conserved in human, mouse and rat. 

1 5 Fugu (Takifugu rubripes): The human sensorin peptide sequence was 

used to screen the fugu genome by sequence similarity searching. One hit 
gave an E value score of 1e-25 against a Fugu genomic fragment 
(CAAB01 000768). The sequence was extracted and the conserved ORF is 
set out at SEQ ID NO. 11. 

20 

Zebrafish (Danio rerio): The Fugu peptide sequence was used to 
screen the non-human non-mouse EST database using tBLASTn to identify 
additional orthoiogs. An EST from a zebrafish adult retina cDNA library was 
found (BG30453g) which gave a partial protein sequence which is well- 
25 conserved to sensorin in Fugu. There were no additional zebrafish ESTs in 
GenBank. 



Tetraodon (Tetraodon nigrovirldis): The Fugu peptide sequence was 
used to screen the. publicly available Tetraodon genomic scaffolds 
30 (http://Fugu.hgmp.mrc.ac.uk/blast/) and a partial sequence of the Tetraodon 
sensorin ortholog was identified. (SEQ ID No. 13) 

Alignment of the human, mouse, rat, Fugu, Tetraodon and zebrafish 
sensorin amino acid sequences is shown in Fig 3. 
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Evaluation of Sensorin Function 

The membrane topology of the sensorin was' assessed using TMPred 
(located at \AAww.ch.embnet.org/softwareyTMPRED_form.html), which 

5 predicted a single N-terminal trans-membrane ™ domain, with the N-temiinus 
intracellular. TMPred was used to assess the membrane topology of the 
sesorin protein. The strongly preferred model predicted only 1 transmembrane 
domain (score 533, considered significant if >500) located between aa 4 and 
22, with the N-terminus locatied inside the cell. The result of this search is set 

1 0 out at Figure 2. 

SignalP, an established signal peptide prediction tool, indicated that the 
sensorin protein has a signal peptide that is cleaved after the first 17 amino 
acids. Sensorin may be compact and globular in shape. 

15 

Further aspects of sensorin function at the biochemical level may be 
determined by those skilled in the art using standard techniques. It is clear 
from the instant invention that sensorin is a survival factor for peripheral 
nerves, as the absence of this gene in humans leads to the neuropathic 
20 condition of HSAN II. 

Expression: Based on ESTs, the HSN2 gene is expressed in the 
sympathetic trunk, kidney, testis, fetal liver spleen, head-neck tumor, germ 
cell tumors, muscle, uterus, retina, whole eye, and CNS (multiple sclerosis 

25 lesions). SAGE expression was detected in pancreatic adenocarcinoma, 
brain astrocytoma, brain glioblastoma, colon adenocarcinoma, ependymoma 
and normal vascular endothelial cells. There is no GNF data. Therefore the 
HSN2 gene appears to be widely expressed in many tissues, but at extremely 
low levels. HSN2 may be expressed in peripheral sensory neurons or 

30 supporting Schwann cells, which usually make up only a small percentage of 
any given tissue's mass. 

It is recognized that those skilled in the art may prefer to use forms of 
HSN2 or sensorin corresponding to the sequences disclosed herein, although 
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not necessarily the same. For example, screening assays may utilize HSN2 or 
sensorin from a different organism, preferably a vertebrate, and most 
preferably from a mammalian species. The shared technical features of these 
fornis of HSN2 or sensorin, are that, when expressed, they have similar 

5 biological activity, and that they share functional similarity with HSN2 or 
sensorin, as the case may be, such as may be determined by those skilled in 
the art. Thus the invention encompasses the use of, Including but not limited 
to, sheep, dog, rat, mouse or horse HSN2 or sensorin, for the same purposes 
as set out more specifically herein for human HSN2 or sensorin. The HSN2 

0 gene and/or sensorin polypeptide according to the invention may also be 
obtained from other mammalian species, other vertebrates, Invertebrates and 
microorganisms based on the disclosure herein. 

Because of the processing that may take place in transfomning the 
5 initial RNA transcript into the final mRNA, the sequences disclosed herein 
may represent less than the full genomic sequence. They may also represent 
sequences derived from alternate splicing of exons. Consequently, the genes 
present in the cell (and representing the genoniic sequences) and the 
sequences disclosed herein, which are mostly cDNA sequences, may be 

20 identical or may be such that the cDNAs- contain less than the full genomic 
sequence. Such genes and cDNA sequences are still considered 
corresponding sequences because they both encode similar RNA sequences. 
Thus, by way of non-limiting example only, a gene that encodes an RNA 
transcript, which is then processed Into a shorter mRNA, Is deemed to encode 

25 both such RNAs and therefore encodes an RNA corresponding to an HSN2 
sequence as disclosed herein. Thus, the sequences disclosed herein 
correspond to genes contained in the cells and are used to detennlne relative 
levels of expression because they represent the same sequences or are 
complementary to RNAs encoded by these genes. Such genes also include 

30 different alleles and splice variants that may occur in the cells used in the 
processes of the invention. 
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Method of Treatment Using HSN2 as a Therapeutic Target 

The discovery- that mutations in HSN2 relate to clearly definable 
physiological outcomes in humans (namely HSAN II) establishes a clear 

5 function for this protein, and now allows the inventors to establish, for the first 
time, that HSN2 Gene and sensorin are useful as therapeutic targets in 
humans for the treatment of pain, neuropathy and related disorders. Standard 
industrial processes are available to those skilled in the art to confirm the 
identity of the therapeutic agents which modulate the activity of the gene or 

1 0 protein, many of which are set out below. 

In accordance with the foregoing, the present invention relates to a 
method for treating a disorder comprising administering to a person in need of 
such treatment an effective amount of a selective sensorin agonist or 
1 5 antagonist, or a pharmaceutically acceptable salt thereof, or a pharmaceutical 
composition containing either entity. 

In a preferred embodiment, said disorder is pain, neuropathy or a 
related disorder. In a further preferred embodiment, said administering is by 
20 oral or intravenous means. 

Identification of Therapeutic Agents. 

The present invention also readily affords different means for 
25 identification of agents for treating pain, neuropathy and related disorders 
according to their ability to modulate the activity of HSN2 or its protein. Such 
means involve testing libraries of chemical compounds, either one at a time or 
in combinations, In an assay format which Is designed to measure a biological 
activity related to HSN2 or sensorin, the protein encoded by HSN2. Those 
30 library compounds that modulate the biological activity in the desired fashion 
are thereby identified as therapeutic agents of the Invention. 

Exemplary assay methods useful for the identification of such 
compounds are detailed herein, although those skilled in the art will be aware 
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of alternative means. In a first step, compounds are sequentially tested 
against the assay to determine whether they influence a measurable 
biological activity of the assay. 

5 Assays may be based one or more of the diverse measurable 

biological activities of a gene or polypeptide corresponding to HSN2 or 
sensorin. Relating to the purified sensorin protein, sensorin activity includes, 
but is not limited to, all those biological processes, interactions, binding 
behavior, binding-activity relationships, pKa, pD, en2yme kinetics, stability, 

1 0 and functional assessments of the protein. Relating to sensorin activity in cell 
fractions, reconstituted cell fractions or whole cells, these activities include, 
but are not limited the rate at which sensorin performs any measurable 
biological characteristic and ail measurable consequences of these effects, 
including cell grov^h, development or behavior and other direct or indirect 

1 5 effects of sensorin activity. Relating to HSN2 genes and transcription, HSN2 
activity includes the rate, scale or scope of transcription of genomic DNA to 
generate RNA; the effect of regulatory proteins on such transcription, the 
effect of modulators of such regulatory proteins on such transcription; plus the 
stability and behavior of mRNA transcripts, post-transcription processing, 

20 mRNA amounts and turnover, and all measurernents of the rate or amount of 
expression and translation of the mRNA into polypeptide sequences. Relating 
to HSN2 activity or sensorin activity in organisms, this includes but is not 
limited biological activities which are identified by their absence or deficiency 
in disease processes or disorders caused by aberrant HSN2 or sensorin 

25 biological activity in those organisms. Broadly speaking, HSN2 biological 
activity or sensorin biological actlvrty can be detemiined by all these and other 
means for analyzing biological properties of proteins and genes that are 
known in the art. 

30 The invention therefore provides numerous assays which measure an 

activity of HSN2 or sensorin and are useful for the testing of chemical 
compounds to identiiy which ones affect such activity. 
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In one aspect, the present invention relates to a method for identifying 
an agent that modulates the activity of the HSN2 gene, comprising: 

a) contacting- a test compound with a cell that expresses an HSN2 

gene; 

5 b) detennining a change in the expression of said gene as a result of 

said contacting, 

wherein said detennined change in expression of the gene Indicates 
modulation, thereby identifying the test compound as an agent that modulates 
the activity of an HSN2 gene, 

10 

In a preferred embodiment of such method, HSN2 is a mammalian 
HSN2 gene, most preferably where the mammal is mouse, rat or human. Also 
preferred is where detennined modulation is a decrease in expression and/or 
where the gene is present in a cell, preferably a mammalian cell, preferably a 
1 5 nervous system cell, such as a neuron. Such cells may include a recombinant 
cell, especially one genetically engineered to contain and express the HSN2 
gene. In a preferred embodiment, said gene has the sequence of SEQ ID NO: 
1, 

20 In other embodiments, the gene comprises a polynucleotide 

corresponding to a polynucleotide having a nucleotide sequence selected 
from the group consisting of SEQ ID NO: 1, 7 and 9. 

In another aspect, the present Invention relates to a method for 
25 identifying an agent that modulates HSN2 gene, comprising: 

(a) contacting a test compound with a genetic conistruct comprising a 
reporter gene operably linked to an HSN2 promoter under conditions where 
the reporter gene is expressed; 

(b) determining a change in expression of the reporter gene as a result 
30 of said contacting, 

wherein a determined change in expression indicates modulation, 
thereby identifying the test compound as an agent that modulates HSN2 
gene. 
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In a preferred embodiment thereof, the modulation is a decrease in 
expression of said i-eporter gene. Nonnally the reporter gene is some readily 
measured gene and is not HSN2 but the HSN2 gene, with promoter 

5 sequence, may be used and still lie within the contemplation of the invention, 
even though the gene is not contained within a cell. Also preferred is where 
the HSN2 promoter Is a mammalian HSN2 promoter, especially where the 
mammal is mouse, rat or human. Most prefenred is where the promoter has 
the nucleotide sequence of SEQ ID NO: 6 and/or where the genetic construct 

10 is present In a cell, preferably a mammalian cell, such as. a nervous system 
cell, preferably a neuron. Of course, such construct will normally be present.ln 
a recombinant cell. 

The present invention also contemplates a method for identifying an 
1 5 agent that modulates the activity of an HSN2-encoded protein, comprising: 

a) contacting a test compound with an HSN2-encoded polypeptide 
under conditions where said polypeptide is active; and 

b) determining a change in the activity of said polypeptide as a result 
of said contacting; 

20 wherein said deterrtiined change in activity indicates modulation, 

thereby identifying the test compound as an agent that modulates the activity 
of HSN2-encoded protein (such as sensorin). 

In a preferred embodiment thereof, the HSN2-encoded polypeptide is a 
25 mammalian HSN2-encoded polypeptide, preferably where the mammal is 
mouse, rat or human. Also preferred is where the determined modulation is a 
decrease in activity of the polypeptide. 

Also preferred is where the polypeptide is present in a mammalian cell, 
30 especially where this has been engineered to contain or express said 
polypeptide, such as by genetic engineering. In one such embodiment, the 
recombinant cell does not express said polypeptide absent said engineering. 
Also preferred is where the cell is a cell of the nervous system, especially a 
neuron, most especially a neuron involved in pain, response. 
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Also preferred is where the polypeptide is encoded by a polynucleotide 
having a sequence selected from the group consisting of SEQ ID NO: 1, 7, 
and 9, especially where the polypeptide comprises an amino acid sequence of 
5 SEQ ID NO: 2, 8, 10 or 11. 

In a preferred embodiment, the observed change in activity in step (b) 
is a decrease In activity and is the result of binding to said polypeptide by said 
chemical agent of step (b): Also preferred is where the agents are useful for 

1 0 treating pain, neuropathy or a related disorder- 
In other preferred embodiments, the polypeptide is part of an Intact cell, 
and the present invention specifically contemplates embodiments in which the 
cell is engineered by other than genetic engineering, such as where the 

1 5 activity of a polypeptide is to be enhanced and the cell has been engineered 
to contain, or have on its surface, said polypeptide but wherein the 
polypeptide is present due to physical insertion of the polypeptide into the 
membrane or cytoplasm of the cell and not through expression of a gene 
contained in the cell. Methods well known in the art, such as use of 

20 polyethylene glycol, viruses, and the like, are available to effect such 
insertions and the details of such procedures need not be further described 
herein. 

In one preferred embodiment of such method, the polypeptide is a 
25 polypeptide that reacts with and/or binds to, an antibody that binds to, or is 
specific for, a polypeptide having an amino acid sequence at least 95% 
identical to, more preferably at least 98% identical to, the sequence of SEQ ID 
NO: 2 and where any difference in amino acid sequence is due only to 
conservative amino acid substitutions. In an especially preferred embodiment, 
30 the polypeptide has the amino acid sequence of SEQ ID NO: 2. 

The sensorin assays of the Invention may employ compound screening 
technology such as (but not limited to) the ability of various dyes to change 
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color in response to changes in assay conditions resulting from the activity of 
the polypeptides. Connpound screening assays can also be based upon the 
ability of test compounds to modulate the interaction of the target peptide 
(sensorin) and known interacting proteins. Such Interacting proteins can be 
5 identified by a variety of methods known in the art, including, for example, 
radioimmunoprecipitation. co-immunoprecipitation, co-purlficatlon, and yeast 
two-hybrid screening. Such interactions can be further assayed by means 
including but not limited to fluorescence polarization or scintillation proximity 
methods. 

10 

Agents that have the effect of modulate the half-life of sensorin in cejis 
would also act to induce decreased sensitivity to pain and thereby achieve 
analgesia. By way of non-limiting example, cells expressing a wild-type 
sensorin are transiently metabolically labeled during translation, contacted 
1 5 with a candidate compound, and the half-life of the polypeptide is determined 
using standard techniques. Compounds that modulate the half-life of the 
polypeptide are useful compounds in the present invention. 

In one such assay for which the polypeptides encoded by genes 
20 disclosed herein are useful, the polypeptide (or a polypeptide fragment thereof 
or an epitope-tagged form or fragment thereof) is bound to a suitable support 
(e.g., nitrocellulose or an antibody or a metal agarose column in the case of, 
for example, a his-tagged fomn of said polypeptide). Binding to the support is 
preferably done under conditions that ^llow proteins associated with the 
25 polypeptide to remain associated with it. Sucl^ conditions may include use of 
buffers that minimize Interference with protein-protein interactions. If desired, 
other proteins (e.g., a cell lysate) are gdded, and allowed time to associate 
with the polypeptide. The immobilized polypeptide is then washed to remove 
proteins or other cell constituents that may be non-specificaily associated with 
30 it the polypeptide or the support. The immobilized polypeptide can then be 
used for multiple purposes. In a compound screening embodiment, 
compounds can be tested for their ability to interfere with interactions between 
sensorin and other bound molecules (which are presumably sensorin 
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interacting proteins). Compounds which can successfully displace Interacting 
proteins are thereby identified as sensorin modulating agents of the Invention. 

In an alternative embodiment designed to identify sensorin interacting 

5 proteins, the immobilized polypeptide is dissociated from its support, and 
proteins bound to it are released (for example, by heating), or, alternatively, 
associated proteins are released from the polypeptide without releasing the 
latter polypeptide from the support The released proteins and other cell 
constituents can be analyzed, for example, by SDS-PAGE gel 

10 electrophoresis, Westem blotting and detection with specific antibodies, 
phospho-amino acid analysis, protease digestion, protein sequencing, or 
Isoelectric focusing. Nomnal and mutant fomns of such polypeptide can be 
employed in these assays to gain additional infomnation about which part of 
the polypeptide a given factor Is binding to. In addition, when incompletely 

1 5 purified polypeptide is employed, comparison of the nomnal and mutant fonns 
of the protein can be used to help distinguish true binding proteins. Such an 
assay can be performed using a purified or semipurified protein or other 
molecule that is known to Interact with a polypeptide encoded by an HSN2 
polynucleotide. 

20 This assay may include the. following steps. 

1 . Harvest the encoded polypeptide and couple a suitable fluorescent 
label to it; 

2. Label an interacting protein (or other molecule) with a second, 
different fluorescent label. Use dyes that will produce different quenching 

25 patterns when they are in close proximity to each other versus when they are 
physically separate (I.e., dyes that quench each other when they are close 
together but fluoresce when they are not in close prqxiniity); 

3. Expose the interacting molecule to th@ immobilized polypeptide in 
the presence or absence of a compound being tested for its ability to interfere 

30 with an interaction between the two; and 

4. Collect fluorescent readout data. 
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An alternative assay for such protein interaction is the Fluorescent 
Resonance Energy Transfer (FRET) assay. This assay can be performed as 
follows. 

1. Provide the encoded protein or a suitable polypeptide fragment 
5 thereof and couple a suitable FRET donor (e.g., nitro-benzoxadiazole (NBD)) 

to It; 

2. Label an Interacting protein (or other molecule) with a FRET 
acceptor (e.g., rhodamine); 

3. Expose the acceptor-labeled interacting molecule to the donor- 
1 0 labeled polypeptide in the presence or absence of a compound being tested 

for its ability to interfere with an interaction between the two; and 

4. Measure fluorescence resonance energy transfer. 

Quenching and FRET assays are related. Either one can be applied in 
15 a given case, depending on which pair qf fluorophores is used in the assay. 

Additionally, drug screening assays can also be based upon 
polypeptide functions deduced upon antisense interference with the gene 
function. Intracellular localization of pain-related polypeptides, or effects 

20 which occur upon a change in intracellular localization of such proteins, can 
also be used as an assay for drug screening. 

In accordance with the foregoing, the present invention provides the 
amino acid sequence of a protein, designated sensorin or HSN2 Polypeptide, 
that is found in neuronal cells (for example, SEQ.ID NO: 2 from humans) and 

25 which is associated with hereditary transmission of insensitivity to pain and 
neuropathy. In addition, several mutations, have been found in this sequence 
derived from individuals found to have such indifference to pain. Thus, agents 
that mimic the phenotypic effects of this mutations, such as aberrant protein 
structure and decreased, or absent, function represent candidate compounds 

30 for evaluation as therapeutic agents of the invention. 

Relating to expression assays, in one aspect the present invention 
relates to a method for identifying an agent that modulates the activity of a 
polynucleotide whose expression contributes to pain sensation, comprising: 
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a) contacting under physiological conditions a chemical agent with a 
polynucleotide corresponding to the promoter region of HSN2 (in a preferred 
embodiment, the promoter has the sequence of SEQ ID NO: 6); and 

b) detecting a change in the expression of said polynucleotide as a 
5 result of said contacting; 

thereby Identifying an agent that modulates said polynucleotide or gene 
activity. 

Such modulation is preferably a decrease or an increase in expression. 

10 In preferred embodiments, such expression is measured by measuring the 
amount of an expression product. In a convenient embodiment, the promoter 
region of HSN2 is operably linked to a reporter gene, that is, a gene which 
whose expression is conveniently measured (for example, reporter genes 
such as Green Fluorescent Protein, luciferase, chloramphenicol acetyl- 

1 5 transferase (CAT), and the like). 

In preferred embodiments, the polynucleotide whose expression is to 
be measured or monitored is present In an intact cell, preferably a mammalian 
cell, most preferably a neuronal cell. In additional preferred embodiments, 
20 such an intact cell is a cell that has been engineered to comprise said 
polynucleotide, such as by genetic engineering, most preferably wherein the 
cell does not express the subject gene or polynucleotide absent having been 
engineered to do so. 

25 In accordance with the disclosure herein, upstream untranslated 

regions and promoter regions of HSN2 are readily obtained from SEQ ID No. 
12 and other publicly retrievable sequence databases. Such genomic or 
untranslated regions may be included in plasmids comprising the identified 
gene, such as in assays to identily compounds which modulate expression 

30 thereof. In one such assay, the upstream genomic region is ligated to a 
reporter gene, and incorporated into an expression plasmid. The plasmid is 
transfected into a cell, and the recombinant cell is exposed to test 
compound(s). Those compounds which increase or decrease the expression 
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of the reporter gene are then modulators of the gene/protein, and are 
considered therapeutic agents of the invention. 

The invention also claims recombinant cells engineered to express a 
5 polynucleotide or polypeptide as disclosed herein. The gene disclosed herein 
as being involved in HSAN II in an animal can be used, or a fragment thereof 
can be used, as a tool to express a protein, where such genes encode a 
protein, in an appropriate cell in vitro, or can be cloned into expression vectors 
which can be used to produce large enough amounte of protein to use in in 

1 0 vitro assays for drug screening. Alternatively, the expression construct may 
employ the genomic promoter region of HSN2 and link it to a gene, such as a 
reporter gene, whose expression level is easily measured. Expression 
systems which may be employed indude baculovlrus, herpes virus, 
adenovirus, adeno-associated virus, bacterial systems, and eukaryotic 

1 5 systems such as CHO cells. Naked DNA and ONA-llposome complexes can 
also be used. The invention thus claims recombinant cell lines containing a 
heterologous HSN2 gene. 

For general molecular biology procedures useful In practicing the present 
20 invention, a number of standard references are available that contain 
procedures well known in the art of molecular biology and genetic engineering 
and which procedures need not be further described herein. Useful references 
include Sambrook, et al., Molecular Cloning: A Laboratory Manual, Second 
Edition, Cold Spring Harbor, N.Y.. (1989), Wu et al, Mettiods in Gene 
25 Biotechnology (CRC Press, New York, NY, 1997), and Recombinant Gene 
Expression Protocols, in Methods in Molecular Biology, Vol. 62, (Tuan, ed., 
Humana Press, Totowa, NJ, 1997). the disclosures of which are hereby 
incorporated by reference. 

30 Such recombinant cells may be used in expression assays for 

analyzing the levels of expression of HSN2 or a suitable reporter gene after 
contacting said cells with agents that may have analgesic properties. The 
levels of gene expression can be quantified by Northem blot analysis or RT- 
PCR, or, alternatively, by measuring the amount of protein produced, by one 
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of a number of methods known in the art, or by measuring the levels of 
biological activity of polypeptides encoded thereby or other genes. In this way, 
the gene expression can serve as a marker, indicative of the physiological 
response of the cells to the agent Accordingly, this response state may be 
5 determined before, and at various points during, treatment of the individual 
with the agent 

Recombinant cell lines are also preferred for the preparation of purified 
protein, if a purified protein assay Is desired. Those skilled in the art are 
10 capable of producing recombinant cell lines and extracting protein fractions 
containing highly purified proteins. These samples can be used in a variety of 
binding assays to identify compounds which interact with the proteins. 
Compounds that interact are therapeutic agents of the Invention, or analogs 
thereof. 

15 

Target selectivity is an important aspect of the development of 
therapeutic agents. The present invention specifically contemplates the 
identification of chemical agents, especially small organic molecules, that 
inhibit the expression of HSN2. as defined herein, or the activity of the 
20 sensorin polypeptide (such as SEQ ID NO: 2 from humans) encoded thereby, 
with high specificity and that have little or no effect on other genes and/or 
polypeptides. 

Thus, in one such preferred embodirnent the methods disclosed herein 
25 for identifying an agent that modulates, preferably inhibits, expression of a 
gene corresponding to the HSAN gene, preferably having the sequence of 
SEQ ID NO: 1 from humans, or on the activity of a polypeptide encoded 
thereby, comprises first identifying such agent and then testing such agent for 
effects on expression or activity of at least one other gene or polypeptide, 
30 preferably a gene or polypeptide with important physiological consequences 
that are preferably not disturiaed by therapeutic inten^entlon, and 
demonstrating little or no effect. 



wo 2004/043999 




:T/CA2003/001716 



In another aspect, the invention provides a method for computationally 
identifying a compound of the invention. The method Involves (a) detemnining 
the active site of a-sensorin protein (i.e. through X-Ray crystallography or 
other techniques); and (b) through computational modeling, identifying a 

5 compound which interacts with th@ active site, thereby identifying a 
compound, or its analog, as a compound which is useful for modulating the 
activity of such a polypeptide. This process is sometimes referred to as in 
silico screening. Sophisticated software for testing the probability of test 
compounds to interact with the target protein, which can test tens of millions 

10 of computer generated compounds, is available to those skilled in the art. 

Potential therapeutic compounds identified using the methods of the 
invention are usually tested in animal model systems to confirm the putative 
efficacy. Thus, in a further aspect, the present invention relates to a method 
1 5 for Identifying an analgesic agent, comprising: 

(a) administering to an animal an agent found to modulate HSN2 gene 
or protein activity, and 

b) determining in said animal a decrease in response to a pain 
stimulus as a result of said administering, 
20 wherein a determined decrease in response to said pain stimulus 

indicates analgesic activity, thereby identifying said agent an analgesic agent. 

Preferably, the animal is a mammal, such as a human being. In specific 
embodiments, the pain stimulus is a heat stimulus and reaction or sensitivity 

25 to hot and/or cold may be measured. |n another eml^odinlent, an electrical 
stimulus may be used. In all cases, the stimulus may be represented as a 
sharp or dull sensation. In some cases, the animal may othenA/ise react 
normally to such stimulus so that a decrease in nonmal response due to the 
test agent is being measured whereas in other cases the animal may initially 

30 possess a heightened sensitivity to the stimulus prior to administering the test 
agent. In all cases, observation of an analgesic effect need not necessarily 
involve a reduced sensitivity or response to pain but may involve simply a 
reduced sensation of a particular stimulus. The analgesics identified by the 
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methods of the invention may induce general analgesia in an animal or may 
have more localized analgesic or anesthetic effects. 

In a further aspect, the present invention relates to a method for 
5 treating a condition in an animal afflicted with a source of chronic pain 
comprising administering to said animal an effective amount of an analgesic 
agent first identified by an assay method of the invention. Preferably, said 
animal is a human patient such as a patient afflicted with a chronic ailment, 
such as a cancerous condition. 

10 

The screening assays of the invention thus simplify the evaluation, 
identification and development of therapeutic agents for the treatment of pain, 
neuropathy and related disorders. 

15 The invention also includes antibodies and immuno-reactive 

substances which target, interact with or bind to sensorin or epitopes thereof. 
Polypeptides encoded by the polynucleotides disclosed herein can be used as 
an antigen to raise antibodies, including monoclonal antibodies. Such 
antibodies will be useful for a wide variety of purposes sudi as for therapeutic 

20 agents, in functional studies, in drug screening assays, in clinical trials and for 
diagnostics. 

For example, in drug screening assays or in clinical trials, the 
effectiveness of an agent determined by a scieening assay as described 

25 herein to increase or decrease gene expression, protein levels, or biological 
activity can be monitored using antibodies specific for sensorin. Altematively, 
the effectiveness of an agent detemnined by a screening assay to modulate 
expression of HSN2, as well as structurally and functionally related genes, 
including genes with high homology thereto, and including protein levels, or 

30 biological activity can be monitored in clinical trials of subjects exhibiting 
decreased altered gene expression, protein levels, or biological activity. In 
such clinical trials, the expression or activity of the genes or polypeptides 
disclosed herein and, preferably, other genes that have been implicated in. for 
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example, congenital resistance to pain stimuli, can be used to ascertain the 
effectiveness of a particular analgesic drug. 

in a preferred embodiment, tiie present invention provides a method for 
5 monitoring the effectiveness of treatment of a subject with an agent (e.g., an 
agonist, antagonist, peptidomimetic, protein, peptide, antibody, nucleic acid, 
small molecule, or other drug candidate identified by the screening assays 
described herein) including the steps of (i) determining that a patient exhibits 
discomfort due to a disease or disorder that causes some type of painful 
10 stimulus; (ii) administering an effective amount of an agent identified using 
one of the screening assays disclosed herein; (ill) ascertaining a reduction to 
pain or other stimuli following said administration and (vi) altering the 
administration of the agent to the subject accordingly. For example, increased 
administration of the agent may be desirable to. decrease the expression or 
1 5 activity of gene or encoded polypeptide, i.e., to increase the effectiveness of 
the agent. 

Where the patient is non-human, biopsy samples can be taken to show 
a decrease in gene expression, such as by measuring levels of protein, 

20 mRNA, or genomic DNA post-administration samples and comparing the level 
of expression or activity of said protein. mRNA. or genomic DNA in the pre- 
administration sample with that of the conresponding post administration 
sample or samples, thereby showing tiie effects of drug administration on one 
or more of the genes disclosed herein and concomitant reduction In pain 

25 response and/or sensitivity. 

Candidate modulators may be purified (or substantially purified) 
molecules or may be one component of a mixture of compounds (e.g., an 
extract or supernatant obtained from cells). In a mixed compound assay, gene 
30 expression is tested against progressively smaller subsets of the candidate 
compound pool (e.g., produced by standard purification techniques, e.g., 
HPLC or FPLC) until a single compound or minimal compound mixture is 
demonstrated to modulate gene or protein activity or expression In a manner 
having analgesic effects. 
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Specific compounds which will modulate HSN2 gene expression, gene 
transcript levels, and protein levels, in a cell include antisense nucleic acids, 
rit)ozymes and other nucleic add compositions, which specifically hybridize 
5 witti HSN2 (Including exons or Introns of such genes, promoters, 3'-tails, and 
the like). These specific compounds are compounds of the invention, and are 
useful for treating Vtie diseases discussed previously. Design and 
manufacturing of such compounds are well known to those skilled in the art. 

10 Specific compounds which modulate the activity of HSN2, more 

specifically of sensorin, include antibodies (polyclonal or monoclonal) which 
bind specifically to an epitope of said polypeptide. These antibody 
compositions are compounds of the invention, and are useful for treating the 
diseases previously discussed. Design and manufacturing of such 

1 5 compounds are well known to those skilled in the art. 

Specific compounds which modulate the activity of HSN2, or its 
encoded protein, the body Include gene therapy vectors comprising all or a 
part of the HSN2 gene sequence, or a mutant HSN2 sequence. As is well 
known to those skilled in the art, gene therapy allows the delivery of HSN2 in 
an organism to cells where it is taken up and expressed, thus changing the 
level or amount of sensorin protein In such ceil. These vectors thereby 
modulate the activity of HSN2 or sensorin in th^ body and are useful for the 
therapeutic indications disctosed herein. 

Specific compounds which modulate the activity of HSN2 or sensorin in 
the body Include small organic molecules. Such compounds may be naturally 
occurring, or they may be synthetic. Collections and combinatorial libraries of 
such compounds are widely available from commercial sources. As know to 
those skilled in the art, a screening assay, such as the assays disclosed in the 
instant specification, can be easily adapted to identify therapeutic agents 
which have the desired HSN2 or sensorin modulating ability. Agonists, 
antagonists, or mimetics found to be effective at reducing response to pain 
stimuli may be confirmed, as useful in animal models (for example, mice. 
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chimpanzees, etc.)- In other embodiments, treatment with a compound of the 
invention may be combined with other analgesics to achieve a combined, 
possibly even synergistic, effect. 

5 Purified or semi-purified HSN2-encoded protein, or sensorin, and any 

biochemically modified versions thereof, are themselves therapeutic agente of 
the invention. Recombinant or nonrrecombinant forms of these proteins or 
fragments can be administered to persons in need thereof for the treatment of 
disorders, such as disorder^ disclosed herein. Preferably, such agents are 
1 0 administered in a pharmaceutically acceptable carrier. Those skilled in the art 
are familiar with techniques for generating such agents, and for detemiining 
conditions of administration. 

Lead Optimization and Analog Development and Selection 

15 

In general, novel drugs having analgesic or anesthetic properties, and 
which modulate sensorin activity, are identified from libraries, possibly large 
libraries, of both natural product or synthetic (or semi-synthetic) extracte or 
chemical libraries according to methods known in the art. Those skilled in the 

20 fieki or drug discovery and development will understand that the precise 
source of test extracts or compounds is not critical to the screening 
procedure(s) of the invention.. Accordingly, virtually any number of chemical 
extracte or compounds can be screened using the exemplary methods 
described herein. Examples of such extracts or compounds include, but are 

25 not limited to, plant-, fungah prokaryotic- or animal-based extracts, 
fermentation broths, and synthetic compounds, as well as modification of 
existing compounds. Numerous methods are also available for generating 
random or directed synthesis (e.g.. semi-synthesis or total synthesis) of any 
number of chemical compounds, including, but not limited to, saccharide-, 

30 lipid-, peptide-, and nucleic acid-based compounds. Synthetic compound 
libraries are commercially available from Brandon Associates (Merrimack, 
NH) and Aldrich Chemical (Milwaukee, Wl). Alternatively, libraries of natural 
compounds in the fomi of bacterial, fungal, plant, and animal extracts are 
commercially available from a number of sources, including Biotics (Sussex, 
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UK), Xenova (Slough, UK), Harbor Branch Oceangraphlcs Institute (Ft. 
Pierce, FL), and PharmaMar, U.S A (Cambridge. MA). In addition, natural 
and synthetically produced libraries are produced, if desired, according to 
methods known in the art:, e.g., by standard extraction and fractionation 
5 methods. Furthermore, if desired, any library or compound is readily modified 
using standard chemical, physical, or biochemical methods. 

De-replicatlon(e.g., taxonomte derepllcation, biological dereplication, 
and chemical dereplication, or any combination thereof) or the elimination of 
10 replicates or repeats of materials already known for their analgesic and/or 
anesthetic activities may be employed whenever possible. 

When a crude extract is found to have analgesic and/or anesthetic 
activities, or both, further fractionation of the positive lead extract is possible 

1 5 to isolate chemical constituent responsible for the observed effect. Thus, the 
goal of the extraction, fractionation, and purification process is the careful 
characterization and identification of a chemical entity within the crude extract 
having such analgesic and/or anesthetic activities. The same assays 
described herein for the detection of activities in mixtures of compounds can 

20 be used to purify the active component and to test derivatives thereof. 
Methods of fi-actionatlon and purificatfon of such heterogeneous extracts are 
known in the art If desired, compounds shown to be useful agents for the 
treatment of pathogenicity are chemically modified according to methods 
known in the art Compounds identified as being of therapeutic value are 

25 subsequently analyzed using any standard animal model of diabetes or 
obesity known in the art. 



In general, these screening methods provide a ready means for 
selecting either natural product extracts or synthetic compounds of interest 
from a large population (i.e. a. chemical library, for example, one produced by 
combinatorial means) which are further evaluated and condensed to a few 
active core structures. Multiple analogs of such core structures may be 
developed and tested to identify those preferred analogs which have 
improved characteristics as therapeutic agents. 
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Improved analogs may also Include compounds with improved stability, 
biodistrlbution, pharmacokinetics or other desirable features for therapeutic 
agents which are not directly related to modulation of the therapeutic target. 
5 In a preferred embodiment, the improved analog of the invention is effectively 
delivered, either by physiological means or assisted means, to cells of the 
body expressing the sensorin protein. Analog compounds are systematically 
screened to evaluate whether they modulate the identified biological activity 
and those that eftectively do so are then therapeutic agents, or further 
1 0 analogs thereof, according to the invention. 



Theraoeutic Agents and Uses Thereof 

The agents contemplated by the present invention are highly selective 
1 5 for the HSN2-encoded protein (sensorin) and administration of such an agent 
to a human or other animal in need thereof provides a treatment for any of the 
disorders exemplified by, or later found to be related to, HSAN II (and 
involving possible mutations of HSN2). For example, an inhibitor mimics the 
effects of one or more of the mutated fonns of the gene as disclosed herein. 
20 Alternately, an agonist would reverse a disorder made evident in HSAN II. 
HSN2 and/or sensorin may also function In the development, survival or 
maintenance of peripheral sensory neurons, or supporting cells and thus may 
be useful in identifying agents that act directly to further the life and activities 
of such cells. Genetic or phannacological down-regulation of this protein may 
25 lead to novel analgesics used to treat pain or other neurological symptoms, 
whereas up-regulation may lead to novel neuronal survival agents which may 
be useful in treating a number of neurodegenerative disorders of both the 
CNS and PNS. As such, the agent identified according to this invention is a 
treatment for pain, neuropathy or a related disorder. 

30 

Disorders which may be related to abenrant HSN2 or sensorin activity 
include pain, for example Inflammatory pain, a neuralgia, a nerve entrapment 
syndrome, and pain associated with a musculoskeletal disorder; 
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neuropathies, for example peripheral, diabetic, autonomic, sensory or motor 
neuropathy; control of sweating; autonomic dysfunction such as control of 
blood-pressure, swallo>Aring, posture, hypertension, hypotension, Gl - 
esophageal dysmotility, impotence, apnea, absence of lacrimation; Riley-Day 
5 Syndrome, temperature control, hypertonia; delayed gastric emptying, 
diarrhea, vomiting, neurological disorders such as seizure, migraine, tremor, 
anxiety, depression, hyperphagia and associated obesity, bipolar disorder, 
Parkinson's, Huntington's, spinal cord injury, multiple sclerosis, traumatic 
brain injury, stroke, neurodegenerative disorders such as Alzheimer's Disease 
1 0 and ALS, and the like. 

A variety of diseases are also treatable based on sensorin's possible 
role as a novel neurotrophic growth factor in the nervous system. Sensorin's 
amino acid sequence is about twice the length of most other known 
15 neurotrophic factors, and may represent a proform that is proteolytically 
cleaved at positions other than at the signal peptide cleavage site. Like other 
neurotrophic factors, sensorin may be expressed at low levels. 

Neurotrophic factors are essential for the development of the nervous 
20 system. They exert a diverse set of influences including neuronal cell survival, 
axonal and dendritic growth and guidance, synaptic structure and 
connections, neurotransmitter release, long-term potentiation and synaptic 
plasticity. Alterations in neurotrophic factor levels can influence cell survival 
and apoptosis during development, myelination, regeneration, pain, 
25 aggression, depression, substance abuse, anxiety, hyperphagia, memory 
acquisition and retention and loss of the peripheral nervous system. It is thus 
suggested that administration of sensorin protein, or fragments thereof, or 
modulators of sensorin or HSN2 activity are useiful for the treatment of these 
diseases. 

30 

Those skilled in the art are familiar with the necessary steps for pre* 
clinical and human clinical trials which are used to establish efficacy and 
safety of the new chemical entities and compounds first identified by the 
invention for use in treating the diseases mentioned herein. 
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Compounds first identified as useful In reducing sensitivity to pain 
stimuli using one or more of the assays of the invention may be administered 
with a pharmaceutlcally-acceptable diluent, can-ier, or excipient, in unit 
dosage form. Conventional phamiaceutical practice may be employed to 
5 provide suitable formulations or compositions to administer such compositions 
to patients. Alttiough oral administration is prefenred, any appropriate route 
of administration may be employed, for example, Intravenous, parenteral, 
subcutaneous. Intramuscular, Intracranial. Intraorbital, ophthalmic, 
intraventricular, intracapsular, intraspinal, intratiiecal, epidural, Intracistemal, 
1 o intraperitoneal, intranasal, or aerosol administration. Therapeutic fomiulations 
may be in the fcrni of liquid solutions or suspension; for oral administration, 
formulations may be in the form of tablets or capsules; and for inti-anasal 
fomnulations, in tiie fomi of powders, nasal drops, or aerosols. 

1 5 Methods well known in the art for making fomnulations are found in, for 

example. Remington: The Science and Practic e of Pharmacy. (19th ed.) ed. 
A.R. Gennaro AR., 1995. Mack Publishing Company, Easton, PA. 
Formulations for parenteral administration may, for example, contain 
exclpients, sterile water, or saline, polyalkylene glycols such as polyethylene 

20 glycol, oils of vegetable origin, or hydrogenated napthalenes. Biocompatible, 
biodegradable lactide polymer, lactide/glycollde copolymer, or 
polyoxyethylene-polyoxypropylene copolymers may be used to control the 
release of the compounds. Ottier potentially useful parenteral delivery 
systems for agonists of tiie Invention include ethylenevinyl acetate copolymer 

25 particles, osmotic pumps, Implantable Infusion s^tems, and liposomes. 
Fonnulations for inhalation may contain exclpients, or example, lactose, or 
may be aqueous solutions containing, for example, polyoxyethylene-9-laur^ 
ether, glycocholate and deoxycholate, or may be oiJy solutions for 
administration in the form of nasal drops, or as a gel.. 

30 

Combination therapies are also contemplated by the inventors. An 
analgesic agent identified by one of the screening methods disclosed herein 
may be administered along with another agent intended to treat a coincident 
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conditions, such as where analgesic and antitumor agents are given together 
or contemporaneously. 

The present invention also relates to a process that comprises a 

5 method for producing a product, such as by generating test data to facilitate 
identification of such product, comprising identifying an agent according to 
one of the disclosed processes for identifying such an agent (i.e., the 
therapeutic agents identified according to the assay procedures disclosed 
herein) wherein said product Is the data collected with respect to said agent 

10 as a result of said identification process, or assay, and wherein said data is 
sufficient to convey the chemical character and/or structure and/or properties 
of said agent. For example, the present Invention specifically contemplates a 
situation whereby a user of an assay of the invention may use the assay to 
screen for compounds having the desired enzyme modulating activity and, 

15 having identified the compound, then conveys that information (i.e., 
information as to stmcture, dosage, etc) to another user who then utilizes the 
information to reproduce the agent and administer it for therapeutic or 
research purposes according to the invention. For example, the user of the 
assay (user 1) may screen a number of test compounds without knowing the 

20 structure or identity of the compounds (such as where a number of code 
numbers are used the first user is simply given samples labeled with said 
code numbers) and, after, performing the screening process, using one or 
more assay processes of the present invention, then imparts to a second user 
(user 2), verbally or in writing or some equivalent fashion, sufficient 

25 information to identify the compounds having a particular modulating activity 
(for example, the code number with the corresponding results). This 
transmission of infonnation from user 1 to user 2 is specifically contemplated 
by the present invention. 

30 in accordance with the foregoing, the present invention encompasses a 

method for producing test data with respect to the gene modulating activity of 
a compound comprising: 

(a) contacting a test compound with a HSN2 gene under conditions 
where said gene is being expressed; 
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b) determining a change in the expression of said gene as a result of 

said contacting, and 

(c) producing "test data with respect to the gene modulating activity of 
said test compound based on a change in the expression of the determined 
5 gene as a result of said contacting. 

Diaonostics and Phamnacoaenomics 

In a further embodiment; the invention relates to diagnostic and 
10 pharmacogenomic compounds, kits and methods. This aspect relates to 
analysis HSN2 for the diagnosis of insensitivity or indifference to pain, other 
pain disorder, or in the selection of a therapeutic agent for a patient (i.e. 
pharmacogenomics). 

1 5 For example, nucleic acid analysis can be used to identify the HSN2 

mutations disclosed herein, thus confimning the diagnosis of HSAN 11. Many 
nucleic acid diagnostic techniques are well known to those skilled in the art. 
Such techniques include DNA sequencing, hybridization probing, single 
stranded conformational analysis, PGR based techniques such as mismatch 

20 amplification, and myriad other well known methods. All such analysis can be 
performed on a small sample of blood, saliva, urine or other tissue provided 
by the patient. 

Alternatively, protein based analyses such as antibody based assays 
25 (Elisa, Radioimmunoassay and the like) can be employed to identify the 
expression, amount or presence or absence of a mutant protein (such as 
sensorin encoded by a mutant HSN2), such as those disclosed herein. 

Gene expression, both cornparable and absolute, as well as biological 
30 activity, and mutational analysis can each serve as a diagnostic tool for pain 
or neuropathic disorders; thus detemnination of the amount of HSN2 mRNA 
can be used to diagnose the presence or absence of a mutation correlated 
with such pain or neuropathic disorder. 
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Pharmacogenomics deals with clinically significant hereditary variations 
in the response to drugs due to altered drug disposition and abnormal action 
in affected persons (Eichelbaum, M., Clin. Exp. Phamnacoi. Physiol., 23:983- 
985, 1996; Under, M. W„ Clin. Chem., 43:254-266, 1997), In general, two 
5 types of phannacogenetic conditions can be differentiated. Genetic conditions 
transmitted as a single factor altering the way drugs act on the body (altered 
drug action) or genetic conditions transmitted as single factors altering the 
way the body acts on drugs (altered drug metabolism). Altered drug action 
may occur in a patient having a polymorphism (e.g.. an single nucleotide 
10 polymorphism or SNP) in promoter, intronic, or exonic sequences of HSN2. 
Thus by determining the presence and prevalence of polymorphisms in HSN2 
in an Individual, one may predict a patient's response to a particular 
therapeutic agent. 

15 This phamiacogenomic analysis can lead to the tailoring of drug 

treatments according to patient genotype, including prediction of side effects 
upon administration of therapeutic agents, particularly therapeutic agents for 
treating disorders disclosed in this specification. Pharmacogenomics allows 
for the selection of agents (e.g., drugs) for therapeutic or prophylactic 

20 treatment of an individual based on the genotype of the individual (e.g., the 
genotype of the individual is examined to determine the ability of the individual 
to respond to a particular agent). 

Diagnostics employing a gene or protein corresponding to HSN2 can 
25 also be useful in selecting patients for clinical trials of a potential therapeutic 
agent. Patients can be stratified according to the DNA or protein sequence of 
HSN2 and their response to drug treatment can be evaluated. Such 
stratification can greatly reduce the number of patients required to establish 
efficacy for a potential therapeutic agent. 



30 
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Example 1 

Identification of the genetic mutation responsible for Hereditary Sensory 
and Autonomic Neuropathy - Type II (HSAN II). 

5 

We perfomied a genomewide screen in two geographically isolated 
families, who are likely related, to map the HSAN II locus. We expanded a 
consanguineous multigenerational family with eight affecteds (HSAN4) from a 

10 sibship reported previously and collected a smaller family with two affecteds 
(HSAN3). Most of the patients live within a 100-mile radius in a geographically 
isolated region. In both families the mode of inheritance was autosomal 
recessive and affected members presented with severe and early-onset 
HSAN II. Sensory loss was predominantly distal, but the progression of the 

1 5 disorder varied, involving the trunk in some patients. Because of the common 
geographic origin of our subjects and the relationships established through 
^mily history, it became clear that the different degrees of sensory loss were 
part of a continuum of a single disease process, and both families were 
expected to segregate the identical mutation. Pedigrees are shown in Fig 1. 

20 Additionally, unrelated affected samples from a different population were 
available, and these consisted of two affected sisters (HSAN5-301 and 
HSAN5-302) and another affected individual with no close femily ties to the 
affected sisters (HSAN6-301). 

25 Fifteen family members from HSAN4 and seven family members firom 

HSAN3 were genotyped at 763 autosomal markers and at 48 X markers. A 
genome-wide two-point analysis identified five regions with cumulative two- 
point LOD scores >2.0. Only one region contained a marker that was 
homo2ygous in the three affected siblings in HSAN4 (patients 70-72) and the 

30 two affected siblings in HSAN3 (patients 31-42), This marker, D12S352, with 
a positive cumulative LOD score of 2.64 at zero recombination, suggested 
that HSAN type II likely maps to chromosome 12p13. In addition, this was the 
only genomic segment where a three-marker haplotype was shared between 
all affecteds. Analysis of additional microsatellite markers demonstrated that 
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all affected individuals in both families were homozygous for each marker in a 
1.2 Mb common haplotype. Marker order was consistent between the 
Genethon and MarshfieW genetic maps and the Build 30 human genome 
sequence assembly, suggesting that there were no gross inconsistencies or 
5 recombination hotspots. 22 markers gave significant positive cumulative LOD 
scores, the highest one being 8.4 for marker CA1AC005343 at zero 
recombination. 

The core region of shared marker alleles (see Table 1) places the 
10 HSAN 11 locus within a region on chromosome 12p13 between markers 
CA1AC0021054 (forward primer. 5'-ACCATCACCTAAGGAGACAGACC 
(SEQ ID NO: 30) and reverse primer: 5-TGCAACAAATGTACCACTCTGG 
(SEQ ID NO: 31)) and D12S1642 (forward primer: 5'-AGCTCCTAAATCCCCG 
(SEQ ID NO: 32), and reverse primer: 5'-GCCATGTCTATAAATACCCTG 
1 5 (SEQ ID NO: 33)) based on recombination breakpoints in individuals HSAN4- 
171 and HSAN4-124. 

We also screened the additional affecteds from a different population. 
Individual HSAN6-301 was homozygous for all markers in the region defined 

20 in the HSAN4 family. The two affected sisters. HSAN5-301 and HSAN5-302, 
were not homozygous in the 12p13 region, and instead shared alleles on one 
chromosome with individual HSAN6-301. Alleles on the other chromosome 
are shared between the two sisters, suggesting that they are compound 
heterozygotes and that there are two founder mutations in this population. A 

25 recombination in one of the HSAN5 sisters positions the linked region above 
marker CA1AC005183x2. 

The genomic sequence for the region defined in family HSAN4 is well 
defined, with a BAQ contig consisting of 12 completely finished and one 
30 incomplete BAG sequences. The syntonic region in mouse is ~900 kb in size 
and located on chr 6. The gene order is conserved between human and 
mouse. 
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Candidate genes: A list of the genes mapping to this candidate interval 
was compiled based on the Apr2002 freeze of the human genome assembly 
available from UCSC. A total of 13 known or predicted transcripts are located 
between markers D12S352 and D12S1615. These included NINJ2 
5 (NM_016533). FLJ31553 (AK056115). PRKWNK1 (NM_018979). RAD52 
(NM_002879). ELKS (NM_015064), PPA (BM714852). WNT5B 
(NM_030775). FLJ21432 (BM453101). CACNA2D4 (AL137658). SLIT2L 
(BI754042), FLJ25180 (AK057909), FU31638 (AK056200) and CACNA1C 
(NM_000719). The transcripts were categorized according to their evidence 
1 0 for being real genes, according to the following scheme: 1 - confirmed mRNA; 
2 - mRNA supported by ESTs; 3 - spliced ESTs with other evidence to 
support; 4 - single spliced EST; 6 - unspllced EST; 6 - pseudogene not 
expressed. Many unspliced ESTs were found, and no serious effort was taken 
to label these as transcripts, given the known contamination of genomic DNA 
1 5 or unspliced intronic contamination within the cDNA libraries used to generate 
dbEST, Also, gene predictions were generally ignored unless there was 
independent evidence from mRNA sequence, EST, or non-human gene in 
which case the mRNA was used as transcript information. 

20 Sequencing of the coding regions of these Initial 13 candidate genes 

did not klentily any relevant mutations. A more exhaustive search for 
candidate genes was therefore conducted based on conserved homologies 
between human and mouse genomic assemblies. The human genomic 
sequence representing the entire HSAN II candidate inten/al (1.2 Mb) was 

25 downloaded from the Human Genome Browser website (genome.ucsc.edu/) 
with case toggled to highlight the mouse translated BLAT track (which 
represents regions showing significant protein homology between human and 
mouse). A similar human sequence showing the translated Tetraodon BLAT 
track was also downloaded. These sequences were assembled into a contig, 

30 to which all exons of previously identified candidate genes were added. Then 
the contig was scanned for novel conserved fragments, roughly >80% 
conserved over >100 bp. This identified 64 novel fragments, which were then 
tested for functional homolgy using: 1) BLASTn against the nr database, 2) 
BLASTn against the dbEST, and 3) BLASTx against the nr .database available 
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from the NCBl website (at www.ncbi.nlm.nih.gov/BLAST/). The HSN2 gene 
contained the only novel well-conserved ORF identified in this screen. 

Assembly of HSN2 cDNA contig and transcripts: A total of 21 human EST 
5 sequences (from 8 different cDNA libraries), including BG428619. BF522762. 
and N98701, and a genomic sequence firom BAG AC004765, were 
assembled into a contig. A putative cDNA sequence was derived from the 
genomic sequence, with ends based on human. ESTs (SEQ ID NO. 1). The 
start ATG is immediately preceded by a stop TGA codon. and there is no 
10 further indication of a conserved splicing site upstream of the ORF, 
suggesting that this gene consists of a single unspliced exon. The identified 
open reading frame (ORF) encoding a peptide of 434 aa (SEQ ID NO. 2) was 
used to identify the mouse, rat, and fish ortiiologs from their respective 
genome assemblies. A consensus genomic sequence was downloaded from 
1 5 the UCSC website (SEQ ID NO. 12). The HSN2 gene maps to intron 8 of the 
PRKWNKI gene, and consists of a single exon that is transcribed from the 
same strand as PRKWNKI . 

To screen the ORF for nucleic acid mutations, three separate 
20 amplicons were designed (XH1 8x01 a.. XH1 8x01b, and XH18x01c). PGR 
primers (XGR0720 to XGR0725) were designed using PrimerSelect 
(DNASTAR, Madison Wl) and purchased from BioGorp (Montreal). The primer 
sequences, and amplicon sizes are given in Table 2. The TD1 protocol for 
PGR amplification consisted of the following: initial denaturation for 5 min. at 
25 94G, followed by 17 cycles of denaturation at 95°G for 30 sec, annealing at a 
temperature starting at 70°G and ending at 54»G (-1"G per cycle), and 45 sec 
elongation at 72G, followed by 25 cycles of denaturation at 95^ for 30 sec., 
annealing at 54*>G for 30 sec. and elongation at 72°G for 45 sec. followed by 
a single cycle at 72^G for 5 min. 
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Table 2. 



Ampllcon 


Forward primer 


Reverse primer 


Size (bp) 


Protocol 


XH18x01a 


XGR0720: (SEQIDNO:24) 
TTCCAGAAGCAfrGTTATnATTT 


XGR0721: (SEQ ID NO: 25) 
CCCCCTTGTACTGGCTTCT 


677 


TD1 


XHIdxOlb 


XGR0722: (SEQ ID NO: 26) 
CACCAGAGGCCGTAGTTATGTTG 


XGR0723: (SEQ ID NO: 27) 
TTGAGGAGGCAGTTCTTCTTGATT 


638 


TD1 


XHISxOlc 


XGR0724: (SEQ ID NO: 28) 
GCGCCTGCTGTGTTAACTCATAA 


XGR0725: (SEQ ID Na 29) 
CCAAAGATGGGGAAACTCTACTGA 


669 


TD1 



Samples were ampi'ified using Taq polymerase (Qiagen) on a PE 9700 
thermocycler. All amplified fragmente were tested by agarose gel 
5 electrophoresis. Samples tested were HSAN4-70 (affected homozygous), 
HSAN4-371 (nonmal control from same pedigree) and HSAN5-302 (affected). 
Amplified products and sequencing primers were sent to the Montreal 
Genome Centre sequencing facility for sequence determination. Traces were 
then aligned using a genomic sequence contig constructed in SeqManll 
1 0 (DNASTAR), and variations identified and annotated to an Excel spreadsheet. 
Publicly available predicted SNPs (from NCBI dbSNP) were identified, or 
those found in the cDNA contig were noted. Sequence variants were 
documented by printing the screen image in MS PhotoEditor. selecting the 
relevant sequence trace and pasting into MSWord to make a Figure. 

15 

Identification of mutations in HSN2: Three separate mutations were 
identified that were predicted to gravely affect the ORF of the HSN2 gene 
(see Table 3, SEQ ID NO: 3 to.5). A deletion of an A at position 594 (relative 
to the start ATG), found homozygous jn the HSAN4*70 sample, is predicted to 

20 cause a frameshift in codon 198 leading to premature truncation of a 206 aa 
peptide. Two different mutations were found in the HSAN5-302 sample. The 
first is a 1 bp insertion of an A between bases 918-919, causing a frameshift 
in codon 307, and leading to premature truncation of a 318 aa peptide. The 
second is a C>T change at position 943 that changes codon 315 (CAG, 

25 encoding Glutamine) to a TAG stop codon, therefore prematurely truncating 
the protein at 314 amino acids. Each of these mutations was not found in the 
control sample, dbSNP database, nor available genomic and EST sequences. 
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Table 3. Sequence ID Numbers 



SEQ ID NO 


Description 


1 


Wild-Tvpe Human HSN2 putative cDNA (ends based on human ESTs) 


o 


Wild-Type Human sensor! n 




Mutation 1 - Amino acid sequence of truncated human sensorin in HSAN4- 
70 


4 


Mutation 2 - Amino acid sequence of truncated human sensorin in HSAN4- 
302 


5 


Mutation 3 - Amino acid sequence of truncated human sensorin in HSAN4- 
302 


6 


Genomic promoter sequence for human HSN2 


7 


Wild-Type Mouse HSN2 cDNA predicted (5' and 3' end not determined) 


8 


Wild-Type Mouse sensorin amino acid sequence. 433 amino acids 


9 


Wild-type Rat HSN2 nucleic acid sequence, (genomic sequence defined by 
5* and 3' ends in EST BF522762/AI578184. contains polyA tail) 


10 


Wild-type rat sensorin amino acid sequence. 434 amino acids 


11 


Wild-type Fugu sensorin amino acid sequence 


12 


Genomic structure and sequence of the human HSN2 


13 


Partial wild-type amino acid sequence of Tetraodon sensorin 


14 


Human cDNA; mutation 1. c.594delA 


15 


Human cDNA: mutat'on 2, c.918insA 


16 


Human cDNA; mutation 3, c.943C>T 


17 


Pig HSN2 Nucleotide Sequence 


18 


Pig HSN2 Amino Acid Sequence 


18 • 


Human HSN2 translated from 2"° ATG 


20 


Human HSN2 translated from 3"" ATG 


21 


Mouse HSN2 translated from 2"° ATG 


22 


Mouse HSN2 translated from 3"^ ATG 


23 


Rat HSN2 translated from 2"° ATG 



5 SEQ ID NO: 17 is the complete sequence of pig (Sus scrofa) cDNA 

clone, which was determined from clone MI-P-AY1-nrg-c-02-0-UI generated 
by Pig Genome Project and partially sequenced with Genbank accession 
number BI399422. SEQ ID NO: 18 is the sequence for the pig HSN 
polypeptide. 

10 

Analysis of SEQ ID NO: 19 to 23 shows that the 3rd ATG is closest to 
the optimal Kozak consensus C-C-[AG1-C-C-A-T-G-G context for Initiation of 
translation, but Is not conserved in rat Hsn2 genomic sequence. 
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WHAT IS CLAIMED IS: 

1. A method for identifying an agent that modulates the activity of the 
HSN2 gene, comprising: 

5 a) contacting a test compound with a cell that expresses an HSN2 

gene; 

b) determining a change in the expression of said gene as a result of 
said contacting, 

wherein said determined change in expression of the gene indicates 
1 0 modulation, thereby identifying the test compound as an agent that modulates 
the activity of an HSN2 gene. 

2. The method of claim 1 wherein said HSN2 gene is a mammalian 
HSN2I gene. 

15 

3. The method of claim 2 wherein the mammal is mouse, rat or human. 

4. The method of claim 1 wherein said modulation is a decrease in 
expression. 

20 

5. The method of claim 1 wherein said cell is a mammalian cell. 

6. The method of claim 5 wherein said cell is a recombinant cell. 

25 7. The method of claim 5 wherein said cell is a cell of the nervous 

system. 

8. The method of claim 1 wherein said gene comprises a 
polynucleotide corresponding to a polynucleotide having a nucleotide 

30 sequence selected from the group consisting of SEQ ID NO: 1, 7 and 9. 

9. A method for identifying an agent that modulates HSN2 gene, 
comprising: 
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(a) contacting a test compound with a genetic construct comprising a 
reporter gene operabiy linked to an HSN2 promoter under conditions wliere 
the reporter gene is expressed; 

(b) determining a change in expression of the reporter gene as a result 
5 of said contacting , 

wherein a determined change in expression indicates modulation, 
thereby identifying the test compound as an agent that modulates HSN2 
gene. 

10 10. The method of claim 9 wherein said modulation is a decrease in 

expression of said reporter gene. 

11. The method of claim 9 wherein said HSN2 promoter is a 
mammalian HSN2 promoter. 

15 

12. The method of claim 11 wherein the mammal is mouse, rat or 
human. 

13. The method of claim 9 wherein said promoter is the promoter of 
20 SEQ ID NO: 6. 

14. The method of claim 9 wherein said genetic construct is present in 

a cell. 

25 15. The method of claim 9 wherein said cell is a mammalian cell. 

16. The method of claim 9 wherein said cell is a recombinant cell. 

17. The method of claim 15 wherein said cell is a nervous system cell. 

30 

18. A method for identifying an agent that modulates the activity of an 
HSN2-encoded protein, comprising: 

a) contacting a test compound with an HSN2-encoded polypeptide 
under conditions where said polypeptide is active; and 
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b) determining a cliange in the activity of said polypeptide as a result 

of said contacting; 

wherein said -determined change in activity Indicates modulation, 
thereby identifying the test compound as an agent that modulates the activity 
5 of HSN2-encoded protein. 

19. The method of claim 18 wherein said HSN2-encoded protein is a 
mammalian HSN2-encoded polypeptide. 

10 20. The method of claim 19 wherein the mammal is mouse, rat or 

human. 

21. The method of claim 18 wherein said modulation is a decrease in 
activity. 

15 

22. The method of claim 18 wherein said protein is present In a 
mammalian cell. 

23. The method of claim 22 wherein said cell has been engineered to 
20 contain said protein. 

24. The method of claim 23 wherein said cell was engineered by 
genetic engineering. 

25 25. The method of claim 23 wherein said cell does not contain said 

protein absent said engineering. 

26. The method of claim 22 wherein said cell is a cell of the nervous 
system. 

30 

27. The method of claim 22 wherein said protein Is encoded by a 
polynucleotide having a sequence selected fipom the group consisting of SEQ 
ID NO: 1.7, and 9. 
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28. The method of claim 22 wherein said protein comprises an amino 
acid sequence of SEQ ID NO: 2, 8, 10 or 1 1, 

29. A method of treating a pain-related disorder comprising 
5 administering to an animal in need thereof a therapeutically effective amount 

of an HSN2 modulator. 

30. The method of claim 29 wherein said HSN2 modulator exhibits 
modulating activity in an assay method of one of claims 1 to 28. 

10 

31. The method of claim 29 wherein said agent was first Identified as 
an HSN2 modulator using an assay method of one of claims 1 to 28. 

32. A method for treating an HSN2-related disorder comprising 
1 5 administering to a person in need thereof an effective amount of a selective 

sensorin agonist or antagonist, or a pharmaceutically acceptable salt thereof, 
or a pharmaceutical composition containing said agonist or antagonist. 

33. The method according to claim 32 wherein said disorder is pain, 
20 neuropathy or a related disorder. 

34. The method of claim 33, wherein said pain disorder is selected 
from the group consisting of inflammatory pain, a neuralgia, a nerve 
entrapment syndrome, and pain associated with a musculoskeletal disorder. 

25 

35. A method of diagnosing the presence of, or risk of developing, an 
HSN2-related disorder comprising determining the presence of a mutation in 
the nucleic acid sequence of the HSN2 gene of said individual or in the amino 
acid sequence of an HSN2-encoded protein of said individual. 

30 

36. A method for identifying an agent that modulates HSN2 gene 
activity or sensorin activity, comprising: (a) contacting a test compound with a 
cell expressing HSN2; and (b) determining a change in expression of a HSN2 
nucleic acid or the activity of sensorin, wherein said, change indicates 
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modulation thereby identifying the test compound as an agent that modulates 
HSN2 gene activity or sensorin activity. 

37. The method of claim 36, wherein the test compound is a small 
5 molecule. 

38. The method of claim 36. wherein the test compound is an anti- 
sensorin antibody. 

10 39. The method of claim 36, wherein the test compound Is an 

antisense HSN2 nucleic acid molecule. 

40. The method of claim 36, wherein the test compound is a HSN2 
ribozyme. 

15 

41. An isolated polynucleotide comprising a nucleotide sequence with 
at least 60% identity to a sequence selected from the group consisting of SEQ 
ID NO: 1, 6, 7, 9 and 12 wherein said isolated polynucleotide encodes a 
polypeptide that binds to an antibody specific for a polypeptide having the 

20 amino acid sequence of SEQ ID NO: 2. 

42. The isolated polynucleotide of claim 41 wherein said identity is at 
least 70%. 

25 43. The isolated polynucleotide of claim 41 wherein said identity is at 

least 78%. 

44. The isolated polynucleotide of claim 41 wherein said identity is at 
least 90%. 

30 

45. The isolated polynucleotide of claim 41 wherein said identity is at 
least 95%. 
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46. The isolated polynucleotide of claim 41 wherein said identity is at 
least 98%. 

47. The isolated polynucleotide of claim 41 wherein said polynucleotide 
5 has a sequence selected from the group consisting of SEQ ID NO: 1, 6, 7, 9 

and 12. 

48. An isolated polypeptide comprising an amino acid sequence with at 
least 90% identity to an amino acid sequence selected from the group 

0 consisting of SEQ ID NO: 2, 8. 10. 11 and 13 wherein said polypeptide binds 
to an antibody specific for a polypeptide having the amino acid sequence of 
SEQ ID NO: 2.. 

49. The isolated polypeptide of claim 48 wherein said percent identity is 
5 at least 95%. 

50. The isolated polypeptide of claim 48 wherein said percent identity is 
at least 98%. 

20 51. An Isolated polypeptide comprising a polypeptide having an amino 

acid sequence selected from the group consisting of SEQ ID NO: 2, 3, 4, 5, 8, 
10, 11 and 13. 

52. A method for identifying an analgesic agent, comprising: 

25 a) administering to an animal an agent found to modulate HSN2 gene 

or HSN2-encoded polypeptide activity, and 

b) determining in said animal a decrease in response to a pain 
stimulus as a result of said administering, 

wherein a decrease in response to said pain stimulus indicates 
30 analgesic activity, thereby identifying said agent as an analgesic agent. 

53. The method of claim 52 wherein said animal is a mammal. 
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54. The method of claim 53 wherein said mammal is a mouse, a rat or 
a human being. 

55. A method for producing test data with respect to the gene 
5 modulating activity of a compound comprising: 

(a) contacting a test compound with a HSN2 gene under conditions 
where said gene is being expressed; 

b) determining a change In the expression of said gene as a result of 
said contacting, and 

10 (c) producing test data with respect to the gene modulating activity of 

said test compound based on a change in the expression of the determined 
gene as a result of said contacting. 
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Alignment with human hsn2pep: 



pEST€rame+ 
human 



pESTf rame-f- 
human 



pESTf rame+ 
human 



pESTf rame+ 
human 



pESTf rame-l- 
hximan 



pESTf rame+ 
human 



pESTf rame+ 
human 



pESTf ram©+ 
human 



pESTf rame+ 
human 



20 

M 



40 



MYE LL VL F MLI Q PQ SM AH P |gGT PT Y P E S QI F F PT I HE R P VS F S P P PT C P 
MYE LLVLFMLI Q PQ S MAH P§GGT P TYPE SQIFF PTIHERPVSF S PPPTCP 



MYELIiVIiPMLIQPQSMAHP GGTPTYPESQIFF PTIHERPVSF SPPPTCP 

60 * 80 * 100 



P?rv^AISQRRP:STSFLEAgTHHFgPLLRTVGQiLLPPGGi^PTNWTPEA\A-a^^^ 
P K VAX S Q R R K S T S F LE AQT HHF Q PLL RT VGqI LLP PGiGHPTWrriTT PE Av^-m 



PKVAISQRRKSTSFLEAqa?HHFQPIiLRTVGQ LIiPPGG PTNWTPBAWM 
120 . * 140 * 



PLFE PT(jv i^D^RPGLVLPEEAHYFI POEAW 
CE X 0 v^PMF E P S QW^dSr PGLVL PE E A.HYF X PQE AW 



LGTTASRVTGE CEIQV P6FEP30VY D RPGLVLPEEAHYFIPQEAVY 





GTTASRVT 


GE 


L^ 


GTTASRVT 


GE 



160 




180 * 200 



VL S S PHKQX PE QK PVQGGP^S S S VF 
vLSSPMKQl 9EQK PVQGGPHS S S VF 



VAGVHYQ 6AEQ5EGIPYNS VLSdPMKQIPEQKPVQGGP SSSVFEFP 



220 



240 



SGQAFLVGHLQMLRLDSGL||pGS PLS 31 SppX STDATRLKFHPVFVPHSA 
SGQAF LVGHLQMLRLDSGlSpGS PLS S X S^PX S T DAT RLKF H PVF VPH S A 



SGQAFLVGHLQNLRIiDSGL PGS PJJ.S.SI S. PISTDATRLKFHPVFVPHSA 

260 * 280 " ' ★ • 3D0 



P A.VLT HMI-JE S R SMC VF E F HVHT P S S S S GE GGgr-ZL PQRI YRHRQVAVD LMC 
E-AVLT HNNE S R SNCVF E F HVHT P S S S S GEGGIi L PQRVYRNRQ\rAVDLMf; 



PAVLTHNNE SRSNCVFEFHVHT PS S S SGEGG 6I1PQR6YRNRQVAVDI1NQ 

320 * 34n 



E e||p PQ s^glh'G^^lq pvt e eShmf 

E E ^ P P Q S QGL H g3l Q PVT E E giHM Y 


plPELTVSWE P 

^PELTVSWEpj 


gC4Qg C'lTPIGS PEYS S 
SgqI [.[T PX GS PE Y 3 S 


EE PPQS 6LHG LQPVTEE HNS 
360 * 


PELTVSWEP 

380 


GQ WPIGSPEYSS 

* 400 



DS SQIT S SDPSDPQS &PPTGC4HAAFPGSDVSLPFIHLPQTVIQE SPLFPi 
DS S'31T S SDPSDFQS PPPTGg|aaPFC4SDVSMPFIHLPQTVLQE S PLFPC 



DS SQIT S SDPSDFQS PPPTGG AAPEG&DVS 6PFIHLPQTV6QE S PLFFC 

* 420 •* . 



F PQGTT sSi^glLSASF 3 SGGSA.LH PQVIGP^LPQ 
F PQGTT sBgVLTA.SF S SGGSA.LH PQVI GKl. PQ 



is : 433 

: 434 

FPQGTTS Q6L3ASPSSGGSAI.HPQV1GKLPQ ■ 
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SEQUENCE LISTING 



<110> Xenon Genetics Inc. 



<120> HSAN II Related Gene and Expression Products 
and Uses Thereof 



<130> 760050-110 



<140> 
<141> 

<150> US/60/425, 601 

<151> 2002-11-12 



<150> US/60/502,453 
<151> 2003-09-12 



<160> 33 



<170> Patentin version 3,0 



<210> 1 

<211> 2786 

<212> DNA 

<213> Homo sapiens 



<400> 1 

cgtggtggcg 

aacccaggag 

agaggaagac 

ttatgatcag 

aatataataa 

ctagtcaaat 

ggcatactat 

aagaatatag 

actagtcggg 

gatgtaattt 

ttacttacaa 

tcttatctaa 

tttttactat 

tgtaaccatt 

catatgaatg 

tccgtgtggg 

acgtccagtt 

gcgtaagagc 

tgttggccaa 

agttatgttg 

tcctatgttt 

agaagctcac 

ccgggtggca 

gaaacagata 

atttccatct 

attgggtccg 

tttgaaattt 

tgagagcaga 

agaaggaggt 

gaatcaagaa 

tgaagaaaag 

acagaactgg 



cacgcgtgta 
gcggaggttg 
tccatctcaa 
aaaaatggac 
aactattaat 
tatatatctt 
taattgataa 
ggaaagttag 
cttttatgcc 
acattaatgc 
agtttaattc 
agcaaagagc 
gcacataatt 
tcatagggtt 
tatgaattac 
gggaccccaa 
tctttttcac 
acctccttcc 
agtcttcttc 
ggtactacag 
gaaccatctc 
tattttattc 
gaacagtatg 
cctgaacaga 
ggacaggctt 
ggatctcccc 
caccctgtct 
agcaactgtg 
ggaattttac 
gaactgcctc 
cataattacc 
ccaataggaa 



gtcccagcta 
cagtaagcca 
aaaaaagaaa 
attttgtgat 
catctagctt 
ttactatcca 
gaaaataaaa 
ttcaatagtt 
ttgttttagt 
ttaagagtga 
ttccatttcc 
ccacagattg 
tccagaagca 
acacagaact 
ttgtcttatt 
catacccaga 
cacctcccac 
tggaagccca 
cacctggtgg 
ccagtagagt 
aagtttacag 
ctcaggaagc 
agggcattcc 
agccagtaca 
tcctggtagg 
tctctagtat 
ttgttcctca 
tatttgaatt 
ctcagcgtgt 
ctcaatcagt 
atgccccaga 
gcccagaata 



cttgggaggc 
agattgtcac 
aaagaaattt 
ttaactctgt 
gggagagata 
ccaaaaatct 
ctgaaggcct 
taaattaaag 
tcttactctt 
actttttaag 
tttgagagag 
atttcattgg 
ttgttattta 
acccagttgfc 
catgttgata 
atcacagata 
ctgcccaccg 
aactcaccac 
cagcccaact 
aactggagag 
tgactataga 
agtgtatgta 
atacaactca 
agggggccct 
acaccttcag 
ttctgcacct 
ttctgcgcct 
tcatgttcac 
ttaccgaaat 
tggattacat 
attgaccgtt 
ttccagtgat 



tgaggcagaa 
tgcactccag 
catggttatg 
aacatgtttc 
ggagaaagac 
cttctgattt 
ctaacatatc 
cacacttctt 
cctttaactc 
tgtgggtaaa 
gaaagttatg 
ccctggatgt 
tttattaatt 
gcatgtctga 
cagcctcagt 
tttttcccaa 
aaagtagcca 
ttccaacccc 
aactggacac 
tcatgtgaga 
cctggactag 
gctggggtac 
tcagtactgt 
acttcaagtt 
aatctaagat 
atcagtacag 
gctgtgttaa 
acaccaagct 
cggcaggttg 
ggctacttgc 
tctgtggtag 
tcctcacaaa 



gaatcccttg 

cctgggcgaa 

caactcttat 

atgtagtaaa 

attactgtca 

ctggttagaa- 

acagggtaat 

acagtataga 

tttttctgtt 

aacgaaataa 

gaaaagcagc 

atttaatgga 

ataaatttag 

tgtaatttca 

ccatggcgca 

ctattcatga 

tttcccagcg 

tgctgaggac 

cagaggccgt 

tacaggtcca 

tacttccaga 

attaccaggc 

caagtcctat 

ctgtctttga 

tagattctgg 

atgctacacg 

ctcataacaa 

cctcttcagg 

cagtggactt 

agcctgtgac 

agcctatcgg 

tcacttcttc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
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agaccccagt gattttc^^^acctccccc 

tgacgtctca atgcccttta tccatctgcc 
cttctgtttc ccccaaggaa ccacatctca 
aggatctgca cttcatccac aggttatagg 
actttgcacc ataacattta aattttctat 
gaaatattgt ttaattttat cagtagagtt 
atatgcattt aaaaacaaaa tgtgagagaa 
ccagtggaaa aaacacaaaa actagaattt 
cacgttattt tcaaccagta gtgaaaatgc 
ttatttagga atacaaatct taagtattac 
aggataaatt ttttttctat aaaatataaa 
gtaatattaa taatacacag aattttcatt 
atcatgggaa taaaaaaaga tagaaaagga 
aaatgccttt ttaaaatggc atcagtcaag 
taaaatgcta attcataaaa ttaagg 



tacaggggga 
tcagacagtg 
gcaggtctta 
aaaacttcca 
tccttatttc 
tccccatctt 
gctacctgat 
tagtcattct 
aagtgtatgt 
tttcttcctc 
acatggaaaa 
agtgtcgaag 
aacagtttag 
caagttgctg 



gcagctgcac 
ttacaagaat 
actgcctcat 
caattatttt 
cctgaatcat 
tggggggttg 
ttacctatta 
tcacaatcac 
aatgtatggt 
caaacaagtt 
tagggaatgc 
gatctaaaaa 
gaatttgcct 
tgcattatta 
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ggctc 1980 



ccccactttt 
tttcttcagg 
aaactaccct 
ggattttgga 
tgaactacat 
tatgtgaaaa 
gacttctatg 
tgacccagca 
tttaa&aaat 
tgtttttgag 
gacaaagtat 
taacaaatga 
tatgtccaaa 



2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2786 



<210> 2 

<211> 434 

<212> PRT 

<213> Homo sapiens 

<400> 2 

Met Tyr Glu Leu Leu Val Leu Phe Met Leu He Gin Pro Gin Ser Met 
1 5 10 ■ 15 

Ala His Pro Cys Gly Gly Thr Pro Thr Tyr Pro Glu Ser Gin He Phe 
20 25 30 

Phe Pro Thr He His Glu Arg Pro Val Ser Phe Ser Pro Pro Pro Thr 
35 ,40 45 

Cys Pro Pro Lys Val Ala He Ser Gin Arg Arg Lys Ser Thr Ser Phe 
50 55 60 

Leu Glu Ala Gin Thr His His Phe Gin Pro Leu Leu Arg Thr Val Gly 
65 70 75 80 

Gin Ser Leu Leu Pro Pro Gly Gly Ser Pro Thr Asn Trp Thr Pro Glu 
85 90 95 

Ala Val Val Met Leu Gly Thr Thr Ala Ser Arg Val Thr Gly Glu Ser 
100 105 110 

Cys Glu He Gin Val His Pro Met Phe Glu Pro Ser Gin Val Tyr Ser 
115 120 125 

Asp Tyr Arg Pro Gly Leu Val Leu Pro Glu Glu Ala His Tyr Phe He 
130 135 140 

Pro Gin Glu Ala Val Tyr Val Ala Gly Val His Tyr Gin Ala Arg Val 
145 150 155 160 

Ala Glu Gin Tyr Glu Gly He Pro Tyr Asn Ser Ser Val Leu Ser Ser 
165 170 175 

Pro Met Lys Gin He Pro Glu Gin Lys Pro Val Gin Gly Gly Pro Thr 
180 185 190 

Ser Ser Ser Val Phe Glu Phe Pro Ser Gly Gin Ala Phe Leu Val Gly 
195 200 205 
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His Leu Gin Asn lieu Arg Leu Asp Ser Gly Leu Gly Pro Gly Ser Pro 
210 215 220 

Leu Ser Ser lie Ser Ala Pro lie Ser Thr Asp Ala Thr Arg Leu Lys 
225 230 235 240 

Phe His Pro Val Phe Val Pro His Ser Ala Pro Ala Val Leu Thr His 
245 250 - 255 

Asn Asn Glu Ser Arg Ser Asn Cys Val Phe Glu Phe His Val His Thr 
260 265 270 

Pro Ser Ser Ser Ser Gly Glu Gly Gly Gly lie Leu Pro Gin Arg Val 
275 280 285 

Tyr Arg Asn Arg Gin Val Ala Val Asp Leu Asn Gin Glu Glu Leu Pro 
290 295 300 

Pro Gin Ser Val Gly Leu His Gly Tyr Leu Gin Pro Val Thr Glu Glu 
305 310 315 320 

Lys His Asn Tyr His Ala Pro Glu Leu Thr Val Ser Val Val Glu Pro 
325 330 335 

lie Gly Gin Asn Trp Pro lie Gly Ser Pro Glu Tyr Ser Ser Asp Ser 
340 345 350 

Ser Gin lie Thr Ser Ser Asp Pro Ser Asp Phe Gin Ser Pro Pro Pro 
355 360 365 

Thr Gly Gly Ala Ala Ala Pro Phe Gly Ser Asp Val Ser Met Pro Phe 
370 375 380 

lie His Leu Pro Gin Thr Val Leu Gin Glu Ser Pro Leu Phe Phe Cys 
385 390 395 400 

Phe Pro Gin Gly Thr Thr Ser Gin Gin Val Leu Thr Ala Ser Phe Ser 
405 410 415 

Ser Gly Gly Ser Ala Leu His Pro Gin Val lie Gly Lys Leu Pro Gin 
420 425 430 

Leu Phe 



<210> 3 

<211> 206 

<212> PRT 

<213> Homo sapiens 

<400> 3 

Met Tyr Glu Leu Leu Val Leu Phe Met Leu lie Gin Pro Gin Ser Met 
1 5 10 . 15 

Ala His Pro Cys Gly Gly Thr Pro Thr Tyr Pro Glu Ser Gin He Phe 
20 25 30 

Phe Pro Thr He His Glu Arg Pro Val Ser Phe Ser Pro Pro Pro Thr 
35 40 45 



3 



wo 2004/043999 ^Bt/CA2003/001716 

Cys Pro Pro Lys VaH^a lie Ser Gin Arg Arg Lys Ser Thr aBF Phe 

50 55 .60 

Leu Glu Ala Gin Thr His His Phe Gin Pro Leu Leu Arg Thr Val Gly 
65 70 75 80 

Gin Ser Leu Leu Pro Pro Gly Gly Ser Pro Thr Asn Trp Thr Pro Glu 
85 90 95 

Ala Val Val Met Leu Gly Thr Thr Ala Ser Arg Val Thr Gly Glu Ser 
100 105 110 

Cys Glu lie Gin Val His Pro Met Phe Glu Pro Ser Gin Val Tyr Ser 
115 120 125 

Asp Tyr Arg Pro Gly Leu Val Leu Pro Glu Glu Ala His Tyr Phe He 
130 135 140 

Pro Gin Glu Ala Val Tyr Val Ala Gly Val His Tyr Gin Ala Arg Val 
145 150 155 160 

Ala Glu Gin Tyr Glu Gly He Pro Tyr Asn Ser Ser Val Leu Ser Ser 
165 170 175 

Pro Met Lys Gin He Pro Glu Gin Lys Pro Val Gin Gly Gly Pro Thr 
180 185 190 

Ser Ser Ser Val Phe Asp Phe His Leu Asp Arg Leu Ser Trp 
195 200 205 

<210> 4 

<211> 318 

<212> PRT 

<213> Homo sapiens 

<400> 4 

Met Tyr Glu Leu Leu Val Leu Phe Met Leu He Gin Pro Gin Ser Met 
15 10 15 

Ala His Pro Cys Gly Gly Thr Pro Thr Tyr Pro Glu Ser Gin He Phe 
20 25 30 

Phe Pro Thr He His Glu Arg Pro Val Ser Phe Ser Pro Pro Pro Thr 
35 40 45 

Cys Pro Pro Lys Val Ala He Ser Gin Arg Arg Lys Ser Thr Ser Phe 
50 55 60 

Leu Glu Ala Gin Thr His His Phe Gin Pro Leu Leu Arg Thr Val Gly 
65 70 75 80 

Gin Ser Leu Leu Pro Pro Gly Gly Ser Pro Thr Asn Trp Thr Pro Glu 
85 90 95 

Ala Val Val Met Leu Gly Thr Thr Ala Ser Arg Val Thr Gly Glu Ser 
100 105 110 

Cys Glu He Gin Val His Pro Met Phe Glu Pro Ser Gin Val Tyr Ser 
115 120 125 



4 
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Asp Tyr Arg Pro GlJ^^i Val Leu Pro Glu Glu Ala His Tyr IBr lie. 
130 135 140 

Pro Gin Glu Ala Val Tyr Val Ala Gly Val His Tyr Gin Ala Arg Val 
145 150 155 160 

Ala Glu Gin Tyr Glu Gly lie Pro Tyr Asn Ser Ser Val Leu Ser Ser 
165 170 175 

Pro Met Lys Gin lie Pro Glu Gin Lys Pro Val Gin Gly Gly Pro Thr 
180 185 190 

Ser Ser Ser Val Phe Glu Phe Pro Ser Gly Gin Ala Phe Leu Val Gly 
195 200 205 

His Leu Gin Asn Leu Arg Leu Asp Ser Gly Leu Gly Pro Gly Ser Pro 
210 215 220 

Leu Ser Ser lie Ser Ala Pro lie Ser Thr Asp Ala Thr Arg Leu Lys 
225 230 235 240 

Phe His Pro Val Phe Val Pro His Ser Ala Pro Ala Val Leu Thr His 
245 250 255 

Asn Asn Glu Ser Arg Ser Asn Cys Val Phe Glu Phe His Val His Thr 
260 265 270 

Pro Ser Ser Ser Ser Gly Glu Gly Gly Gly lie Leu Pro Gin Arg Val 
275 280 285 

Tyr Arg Asn Arg Gin Val Ala Val Asp Leu Asn Gin Glu Glu Leu Pro 
290 295 300 

Pro Gin lie Ser Trp lie Thr Trp Leu Leu Ala Ala Cys Asp 
305 310 315 

<210> 5 

<211> 314 

<212> PRT 

<213> Homo sapiens 

<400> 5 

Met Tyr Glu Leu Leu Val Leu Phe Met Leu lie Gin Pro Gin Ser Met 
1 ' 5 10 15 

Ala His Pro Cys Gly Gly Thr Pro Thr Tyr Pro Glu Ser Gin He Phe 
20 25 30 

Phe Pro Thr He His Glu Arg Pro Val Ser Phe Ser Pro Pro Pro Thr 
• 35 40 45 

Cys Pro Pro Lys Val Ala He Ser Gin Arg Arg Lys Ser Thr Ser Phe 
50 55 60 

Leu Glu Ala Gin Thr His His Phe Gin Pro Leu Leu Arg Thr Val Gly 
65 70 75 80 

Gin Ser Leu Leu Pro Pro Gly Gly Ser Pro Thr Asn Trp Thr Pro Glu 
85 90 95 



5 
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Ala Vai Val Met Le^^y Thr Thr Ala Ser Arg Val Thr Gly mm Ser 

100 105 110 



Cys Glu He Gin Val His Pro Met Phe Glu Pro Ser Gin Val Tyr Ser 
115 120 125 

Asp Tyr Arg Pro Gly Leu Val Leu Pro Glu Glu Ala His Tyr Phe He 
130 135 140 

Pro Gin Glu Ala Val Tyr Val Ala Gly Val His Tyr Gin Ala Arg Val 
145 150 155 160 

Ala Glu Gin Tyr Glu Gly He Pro Tyr Asn Ser Ser Val Leu Ser Ser 
165 170 175 

Pro Met Lys Gin He Pro Glu Gin Lys Pro Val Gin Gly Gly Pro Thr 
180 185 190 

Ser Ser Ser Val Phe Glu Phe Pro Ser Gly Gin Ala Phe Leu Val Gly 
195 200 205 

His Leu Gin Asn Leu Arg Leu Asp Ser Gly Leu Gly Pro Gly Ser Pro 
210 215 220 

Leu Ser Ser He Ser Ala Pro He Ser Thr Asp Ala Thr Arg Leu Lys 
225 230 235 240 

Phe His Pro Val Phe Val Pro His Ser Ala Pro Ala Val Leu Thr His 
245 250 255 

Asn Asn Glu Ser Arg Ser Asn Cys Val Phe Glu Phe His Val His Thr 
260 265 270 

Pro Ser Ser Ser Ser Gly Glu Gly Gly Gly He Leu Pro Gin Arg Val 
275 280 285 

Tyr Arg Asn Arg Gin Val Ala Val Asp Leu Asn Gin Glu Glu Leu Pro 
290 295 300 

Pro Gin Ser Val Gly Leu His Gly Tyr Leu 
305 310 



<210> 6 

<211> 5562 

<212> DNA 

<213> Homo sapiens 

<400> 6 

gtaagtaaat gcttaagagc atgtaatact acaatgagag ccaatggata gtctgctaaa 60 

ttaaaattct ttgtacaaac caagtaacag tatgagtctg aagtagaaaa tataaaatgc 120 

tgtgtctcta aagccttatt atagagtagg atatagacat taatgtgtgg gcattagctg 180 

aactgcctac atgaagcagc tatgtgcaag atttagatta gtctgcattt ttactaaagg 240 

gaacgctctc tttagcactt aagatgttga aactattcac agcaacaaaa gctggacata 300 

caaatgtttt gcgggtgggg cgagaagagt ttttcaaaag cttgttaaaa tgattactaa 360 

gtatttcctg taagattccc tattgtacaa aacacaggta agaccttcaa accattatag 420 

aatgctgcat gtgagtactc tagattgagg gaatccttta tttgaatgac tgtttaattt 4 80 

agcgcttgtg ttgagttaat catcagttgc tgtgtgtgtt cctgcctttt aaagtctgtg 540 

ttggacacaa gttccttccc cagtatatgt tcttttgctt tttcccttta gttggtaact 600 

gttttattac tgatttattt tcagaatcaa cttatagaaa ttatttcctg aaacatttta 660 

tgcttgtata cacatacaca cacacaatta tcctgtgttt tacattagac actcttgttt 720 



6 
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ctcatataga gaatgtt^BUFgtagatcatc 

agaggaaaat aaaatacgtc atctgcctct 
ctgtacagca cttgtttagt gagatcttta 
ttttaatcta aactttgtgg aattcttgct 
gaaaatactt ctgcttgttc gattttttat 
gtttcagagt ccgtttaatc ctgctttggc 
caaataaaaa actaatatac taacttttta 
tgagggtaga atagaagaac tttataaaga 
caaatattac ttgtcagcat ttcttgtcag 
tattataact agattttgct aagacttttt 
atgggggagc attaggtctc tattttagaa 
gagccattaa tctagctata tttattgatg 
gacagtattc tatggaggta ttatgaagaa 
ctaattctat tcttgtctca tttcaggaat 
gtgaaatttt agaaattaga ataaaagaaa 
aaactaaaca atcatatgat gatatgtaaa 
ccaaagcatt ctacacagcc aagttttgga 
tctaaaagga aaataacagt cttattcaag 
tatagccata ttaaaccctg tgcctgaaca 
agcgtctcac tctgtctccc aggtgggaat 
ttcacctcct gggttcaagt gattctcctg 
gcacccatca ccacgcccag ctaatttttg 
gtgttggcta ggctggtctt gaactcttga 
aaagtgccag gattacaggt gtgagccacc 
cagattgaag atattctccc attgattact 
agttgtgaat ttcttctaga aagctaaact 
ttacaaataa agagtggcaa gtataagaaa 
tcttctaaaa tagaaatgag aatctttttc 
caatcttaaa attctctgta tcttttaatg 
gaatcttaag tttgccaaag ttgcaagcta 
cagtcagttg gtcattctta aattcataca 
ttagcatgat aactgatcat tatcatgatc 
cagaatagtt gtgaatagtt gtgatctgtc 
caaaaaaaaa actgactttg tggaattggg 
gccctgttat cattgtactt ttgtttattt 
gtgtttgttt tgtgttgagc ctcgtcgtgg 
tctgctgttg cctctgtgtc ccgcatctct 
tgtctgcacc tctttctcct tccctcctcc 
gctttctaaa gcattggaga gtgtcctgcc 
acgctccagc ctgccttccc tctttgtcag 
tgaacaatta ttaccaatga atacatccag 
agaacaggac taaacttggt tatattatgc 
cttttttgta ataaggcttt aagggggaat 
ct'ataatgtc cttggcacat accagcatgt 
tcctgtttcc tatcgacata aaattctttc 
ttttaaagtt attgccttaa tcaaaagcaa 
ctcctcactt tattctaagg agacactaat 
aaccttagtc agcaaagaat aaccttggag 
taagatgatg tatcagcaaa ccaaagaact 
ccttaggttt caacctaagg tgggactgta 
gtatgtatga ctttctttgg ctccattaat 
tttaagcatg tgggtccttg ctgcttttcc 
agcaacccca gaattttcag atttttactt 
tggctttgag acataccatg agtaaataac 
aatccttatt tagctattgt gaacagttga 
ctatggaaga atgaaactga attcttttaa 
tactattatt tacttattta tttatgtttt 
ggagtgcagt ggcacgatct cgactcacgg 
tcctgcctca gcctcccgag tagctaggac 
ttttgtattt tagtagagat gggtttcacc 
acattgtgat ccaccccctt cagcctccca 



agataatacc 
gaaaggtatt 
agcagacaga 
cttctttaaa 
cctacagctt 
taggtctcaa 
aaacaatcta 
ctgcacagaa 
tttctacctt 
ttttaaaatt 
agcaagtaga 
ctctaaaagc 
ttattaaata 
gtggcttaca 
taaggctaga 
tacttggtat 
aatgaaaaca 
acaaccttta 
tttttttaag 
acagtggcgt 
cctcagcctc 
tatttttttt 
cctcaagtga 
atgcccggcc 
ttgtaaaagt 
ttttcctagt 
tactgtgagc 
tgaggtttcc 
tactttaaac 
atattcattg 
cttgtaacac 
attcattaaa 
agatgtgcta 
agaggatgga 
tgttttcaca 
ccgtagcatg 
cccagtgctc 
ggactgcccc 
tatgcactct 
tactgtatgt 
gcaacaagaa 
ttctaggata 
aaaaagcaag 
gtccaataaa 
attttcatat 
attttatcaa 
aaaactggat 
tgaacattta 
acataatgtt 
gctctaatga 
gtttataaaa 
aaataaagaa 
aggaga^tga 
aataaaatga 
tctactttgc 
aaagcggggg 
ggagatggaa 
caacctctgc 
tacaggcgca 
atgtttggcc 
cagtgctggg 



ccatttcttt 
agaaaataag 
ttttcccctt 
tgaaaatctt 
ggaataaaag 
aatgaatggc 
ctttctgtct 
aatatatttc 
taattgtttt 
ttctcttgtc 
atgacatgaa 
agacgttctt 
ttgagaattt 
ataatatgga 
gagaaatcaa 
tggtgggcca 
tagtttcata 
tggtattgat 
tttttttttt 
gatcttggct 
cccagtagct 
agtagagaca 
tccacctgcc 
taaattatta 
acaaaatatt 
agatcatgac 
caacaacttt 
agagctcttg 
taatagaatt 
ttttccctgt 
ttggtgttat 
cctgtccttg 
tgttgagtta 
acatttcttc 
gtactgtgtt 
tcggtttgtg 
ttccacccca 
gaggaaactt 
gcctctcagc 
aactgtaaac 
ttttaattaa 
ccagttattc 
gaggtaatga 
atttaccttt 
tctagtttta 
acattaccca 
gagtcctttt 
ttttaattta 
cccccatgga 
atgcattaga 
gatctatctt 
tatagttaac 
agaactatac 
gcattctgac 
cagctcgata 
aggctgtgat 
tctcgctctg 
ctcctgggtt 
tgccaccaca 
aggatggtct 
attacaggcg 



tgtt< 



T/CA2003/001716 



tgttgacctc 
taacttgtca 
ctcagaaagc 
tctctgttaa 
atgttcattt 
cacagtaaca 
ttaaaataat 
tgtgttctga 
ttttctggat 
tatactcatc 
gatattattt 
gctttttaat 
aaaaatttgt 
caaattataa 
aacaggaatg 
cagatttggc 
gttcacaaca 
catactgcct 
tttttttgac 
cactgcaact 
aagattacag 
cagtttcacc 
tcggcctccc 
tttagagatt 
tcctatatcc 
ttttttacct 
ccatctcatg 
caaaataatc 
cattcttgtt 
ctgcaccctt 
ttcagaatat 
actgacaacg 
tagctctcct 
atgcaactat 
tttcatgtgt 
ttcccatctt 
ccgccagtac 
ttgccgaaaa 
gcaagcaccg 
ttctgacaaa 
aattgaataa 
ctgtaggaaa 
gagagaatac 
cattatatct 
aaaggattaa 
aacatttttt 
tttttttaat 
atcatttatg 
agtatttttt 
tgcatatata 
ataaaattat 
tttctgaact 
agtgttatta 
tgaaaataga 
aggatcttcc 
agtgaattct 
tcgccaggct 
caagtgattc 
cccagctaat 
tgatctcctg 
tgagccaccg 



780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
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cgcccagcgg gtaatgai^j^ttatagtgt atattgtttg 
taaatttagt cttaaatcta gaactgcttc tagaataaaa 
gctgtcattt atattaccct tttcagttga ttagaacacc 
ttcatggtta cggccgggct cagtggctca cgcctgtaat 
aagtgagcag atcacctgag gtcaggagtt cgagaccagc 
tgtctctact aaaaatacaa aaattagctg ggcgtggtgg 
tacttgggag gctgaggcag aagaatccct tgaacccagg 
caagattgtc actgcactcc agcctgggcg aaagaggaag 
aaaaagaaat ttcatggtta tgcaactctt atttatgatc 
atttaactct gtaacatgtt tcatgtagta aaaatataat 
ttgggagaga taggagaaag acattactgt cactagtcaa 
caccaaaaat ctcttctgat ttctggttag aaggcatact 
aactgaaggc ctctaacata tcacagggta ataagaatat 
tttaaattaa agcacacttc ttacagtata gaactagtcg 
gttcttactc ttcctttaac tctttttctg ttgatgtaat 
gaacttttta agtgtgggta aaaacgaaat aattacttac 
cctttgagag aggaaagtta tggaaaagca gctcttatct 
tgatttcatt ggccctggat gtatttaatg gatttttact 
cattgttatt tatttattaa ttataaattt agtgtaacca 
ctacccagtt gtgcatgtct gatgtaattt cacatatgaa 



catttggcaa 
agtgctatca 
aggatgtcag 
cctagcactt 
ctggccaaca 
cgcacgcgtg 
aggbggaggt 
actccatctc 
agaaaaatgg 
aaaactatta 
attatatatc 
attaattgat 
agggaaagtt 
ggcttttatg 
ttacattaat 
aaagtttaat 
aaagcaaaga 
atgcacataa 
tttcataggg 
tg 



It' 

g^^g 



yCA2003/001716 
gtttc 4440 



aataaaattg 
aatcagaaaa 
tgggaggccg 
tggcgaaatg 
tagtcccagc 
tgcagtaagc 
aaaaaaaaga 
acattttgtg 
atcatctagc 
ttttactatc 
aagaaaataa 
agttcaatdg 
ccttgtttta 
gcttaagagt 
tcttccattt 
gcccacagat 
tttccagaag 
ttacacagaa 



4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5562 



<210> 7 

<211> 1755 

<212> DNA 

<213> Mus musculus 



<400> 7 

tttgctttga 

cattcaagga 

ctataaattc 

acacagaact 

cttgtcttat 

acatacccag 

ccacctccca 

ctggaagccc 

ccacctggta 

gccaatagag 

cagatttaca 

cctcaggaaa 

gagggcattt 

aagccagtgc 

ttcctggtag 

ct'ctctagta 

tttgtacctc 

gtatttgaat 

caacgtgttt 

cagtcagttg 

gcccccgaat 

ccagaatatt 

cctcccccta 

cgcctgcctc 

acatctcagc 

gttataggaa 

ttctctatct 

ggatttcttt 

ctggtttgcc 

tagtaatgaa 



cagagcaagt 
agtgccacag 
caaaagcatt 
atccagctgt 
tcatgttgat 
aatcacagat 
cctgtccacc 
aaactcgcca 
gcagcccaac 
taaatagaga 
gtgactatag 
cagtatatct 
catacaactc 

cggggggccc 

gacaccttca 
tttctgcgcc 
attctgcacc 
ttcatgctca 
accgaaatcg 
gattacattg 
tgaccatttc 
ccagtgattc 
caggggggac 
agacagtgtt 
aggtcttatc 
aactttcaca 
catttacctg 
tgtggttata 
tgtgatttta 
aatgc 



ttgtgggaaa 

attaacttta 

gttatttatt 

gcatgtctga 

acagcctcag 

atttttccca 

gaaagtagcc 

cttccaaccc 

taactggaca 

gctatgtgag 

acctggacta 

agctggggtg 

accagtactg 

tgcctcaagt 

gaatttaaga 

taacagtaca 

agctgtgtta 

aacaccaagt 

gcaggttgca 

ccacctgcag 

tgtggtagaa 

ctctcaaatt 

agctgcacct 

acaagaatcc 

tgcttcatat 

attcttttaa 

aatcatggat 

aatacttgta 

attgtcagaa 



gcaactgggg 

ttggccctgg 

aaaattttcg 

tgtaatttta 

tccatggcgc 

actattcatg 

atttcccaac 

ctgctgagga 

ccagaggcca 

atgcaggtcc 

gtactggcag 

cattaccagg 

tcaagtccta 

tctgtctttg 

ttagattctg 

gatgctacac 

actaatagca 

tcttcaggag 

gtggactcaa 

cctgttactg 

cctatgggac 

acttcttcag 

tttggctctg 

ccactattct 

tcttcaggag 

actaccctat 

ttataaggaa 

gtaaaaaatg 

tcaatgacta 



tggttttttt 

atgtatttaa 

tgtggccact 

catattgaat 

atccgtgtgg 

aacgtccagt 

gacgtaagag 

ctgttggcca 

tagttatgtt 

aacctgtgtt 

aagaagctca 

cccaggtggc 

tgaaacagat 

aatttccttc 

gaccaagtcc 

atctgaaatt 

atgagaacag 

aaggaggtgg 

atcaggaaga 

aagaacagcg 

aaatctggcc 

atctcagtga 

acgtctcatt 

tctgtttccc 

gatctacact 

tatacaccag 

tgttgtttat 

aactataaag 

atctctttgg 



tttttttttt 
tggatttttg 
tcatggggtt 
gtatgaatta 

ggggacccca 

tt'ctttttca 
cacctccttc 
aaatcatctt 
gggtgctaca 
tgagccaacc 
ctattttatt 
aggacagtat 
atctgaacag 
tggacaggct 
agcatcaccc 
tcaccctgtc 
aagcaactgt 
gattttacct 
actgtctcct 
taataaccat 
aataggaagc 
ttttcaatca 
gccctttatt 
ccaaggaacc 
ccatccacag 
tgcatttaaa 
tttcctagta 
cagaaaatac 
aattttgtgg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1755 



<210> 8 
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<211> 433 . 

<212> PRT 

<213> Mus musculus 

<400> 8 

Met Tyr Glu Leu Leu Val Leu Phe Met Leu lie Gin Pro Gin Ser Met 
1 5 .10 15 

Ala His Pro Cys Gly Gly Thr Pro Thr Tyr Pro Glu. Ser Gin lie Phe 
20 25- 30 

Phe Pro Thr lie His Glu Arg Pro Val Ser Phe Ser Pro Pro Pro Thr 
35 40 45 

Cys Pro Pro Lys Val Ala lie Ser Gin Arg Arg Lys Ser Thr Ser Phe 
50 55 60 

Leu Glu Ala Gin Thr Arg His Phe Gin Pro Leu Leu Arg Thr Val Gly 
65 70 75 80 

Gin Asn His Leu Pro Pro Gly Ser Ser Pro Thr Asn Trp Thr Pro Glu 
85 90 95 

Ala lie Val Met Leu Gly Ala Thr Ala Asn Arg Val Asn Arg Glu Leu 
100 105 110 

Cys Glu Met Gin Val Gin Pro Val Phe Glu Pro Thr Gin He Tyr Ser 
115 120 125 

Asp Tyr Arg Pro Gly Leu Val Leu Ala Glu Glu Ala His Tyr Phe He 
130 135 140 

Pro Gin Glu Thr Val Tyr Leu Ala Gly Val His Tyr Gin Ala Gin Val 
145 150 155 160 

Ala Gly Gin Tyr Glu Gly He Ser Tyr Asn Ser Pro Val Leu Ser Ser 
165 170 175 

Pro Met Lys Gin He Ser Glu Gin Lys Pro Val Pro' Gly Gly Pro Ala 
180 185 190 

Ser Ser Ser Val Phe Glu Phe Pro Ser Gly Gin Ala Phe Leu Val Gly 
195 200 205 

His Leu Gin Asn Leu Arg Leu Asp Ser Gly Pro Ser Pro Ala Ser Pro 
210 215 220 

Leu Ser Ser He Ser Ala Pro Asn Ser Thr Asp Ala Thr His Leu Lys 
225 230 235 240 

Phe His Pro Val Phe Val Pro His Ser Ala Pro Ala Val Leu Thr Asn 
245 250 255 

Ser Asn Glu Asn Arg Ser Asn Cys Val Phe Glu Phe His Ala Gin Thr 
260 265 270 

Pro Ser Ser Ser Gly Glu Gly Gly Gly He Leu Pro Gin Arg Val Tyr 
.275 280 285. 

Arg Asn Arg Gin Val Ala Val Asp Ser Asn Gin Glu Glu Leu Ser Pro 
290 295 300 
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Gin Ser Val Gly Leu His Cys His Leu Gin Pro Val Thr Glu Glu Gin 
305 310 315 - 320 

Arg Asn Asn His Ala Pro Glu Leu Thr lie Ser Val Val Glu Pro Met 
325 330 335 

Gly Gin lie Trp Pro lie Gly Ser Pro Glu Tyr Ser Ser Asp Ser Ser 
340 345 350 

Gin lie Thr Ser Ser Asp Leu Ser Asp Phe Gin Ser Pro Pro Pro Thr 
355 360 365 

Gly Gly Thr Ala Ala Pro Phe Gly Ser Asp Val Ser Leu Pro Phe He 
370 375 380 

Arg Leu Pro Gin Thr Val Leu Gin Glu Ser Pro Leu Phe Phe Cys Phe 
385 390 395 400 

Pro Gin Gly Thr Thr Ser Gin Gin Val Leu Ser Ala Ser Tyr Ser Ser 
405 410 415 

Gly Gly Ser Thr Leu His Pro Gin Val He Gly Lys Leu Ser Gin Phe 
420 425 430 

Phe 



<210> 9 

<211> 1716 

<212> DNA 

<213> Rattus rattus 



<400> 9 

gcagcttttt ctttcattca aggaagtgcc acagattgac tttattggcc ctggatgtat 60 

ttaatggatt tttgctataa attccaaaag cattgttagt tattaaaatt ttggtgtagc 120 

cacttcatgg ggttacacag aactatccag ctgtgcatgt ctgatgtaat tttacatatt . 180 

gaatgtatga attacttgtc ttattcatgt tgatacagcc tcagtccgtg gcgcatccgt 240 

gtggggggac cccaacatac ccagaatcac agatattttt cccaactatt catgaacgtc 300 

cagtttcttt ttcaccacct cctacctgtc caccgaaagt agccatttcc cagcgacgta 360 

agagtacctc cttcctggaa gctcaaactc gccacttcca acccctgctg aggactgttg 420 

gccaaaatca tcttccacct ggtggcagcc caactaactg gacaccagag gccatagtta 480 

tgttgggtac tacagccaat agagtaaata gagagctatg tgagatgcag gtccaacctg 540 

tgtttgagac aacccagatt tacagtgact atagacctgg actagtactg gcagaagaag 600 

ctcgctattt tatcctcagg gaaacagtat atctaggtgg ggtgcattac catccccatg 660 

cggcaggaca gtatgagggt atttcataca actcaccagt actgtcaagt cctatgaaac 720 

agataactga acagaagcca gtgcctgggt gccctgcctc aagttctgtc tttgaatttc 780 

cttctggaca ggctttccta gtaggacacc ttcagaattt aagattagat tctggaccaa 840 

gtccagcatc acccctgtct agtatttctg cgcctaacag tacagatgct acacatttga 900 

aatttcaccc tgtctttgta cctcattctg caccagctgt gttaactcat agcaatgaga 960 

acagaagcaa ctgtgtattt gaatttcatg ctcaaacacc aagttcctct tcaggagaag 1020 

gaggtgggat tttacctcaa cgtgtttacc gaaatcgaca ggttgcagtg gactcaagtc 1080 

aggaagaact gtctcctcag tcagttggat tacactgcca cctgcagcct gttactgaag 1140 

aacagcgtaa taaccatacc ccagaattga ccatttctgt ggtagaacct atgggacaaa 1200 

actggccagt aggaagccca gaatattcca gtgattcctc tcaaattact tcttcagata 1260 

tcagtgattt tcaatcacct ccccctacag ggggaacagc tgcacctttt ggctctgacg 1320 

tctcattgcc ctatattcgc ctgcctcaga cagtgttaca agaatcccca ctattcttct 1380 

gtttccccca aggaaccaca tctcagcagg tcttatctgc ttcatattct tcaggaggat 1440 

ctgcactcca tccacaggtt ataggaaaac tttcacaatt cttttaaaat accctattat 1500 

gcaccaatgc atttaaattc tct.gtctcat ttccctgaat catggattta taagaaatgt 1560 
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tattttctta gtaggatfl^j^tttgtggtt ataaatatat gtactaaaaa aH^acttt 1620 

aaagtggaaa atacctggtt tacctgtgat tttagttgtc agaatcaatg actaatatct 1680 

gtggaatttt gtggtagtag taaaaatgca attgtc 1716 



<210> 10 
<211> 434 
<212> PRT 

<213> Rattus rattus 
<400> 10 

Met Tyr Glu Leu Leu Val Leu Phe Met Leu He Gin Pro Gin Ser Val 
15 10 15 

Rla His Pro Cys Gly Gly Thr Pro Thr Tyr Pro Glu Ser Gin He Phe 
20 25 30 ' 

Phe Pro Thr He His Glu Arg Pro Val Ser Phe Ser Pro Pro Pro Thr 
35 40 45 

Cys Pro Pro Lys Val Ala He Ser Gin Arg Arg Lys Ser Thr Ser Phe 
50 55 60 

Leu Glu Ala Gin Thr Arg His Phe Gin Pro Leu Leu Arg Thr Val Gly 
65 70 75 80 

Gin Asn His Leu Pro Pro Gly Gly Ser Pro Thr Asn Trp Thr Pro Glu 
85 • 90 95 

Ala He Val Met Leu Gly Thr Thr Ala Asn Arg Val Asn Arg Glu Leu 
100 105 110 

Cys Glu Met Gin Val Gin Pro Val Phe Glu Thr Thr Gin He Tyr Ser 
115 120 125 

Asp Tyr Arg Pro Gly Leu Val Leu Ala Glu Glu Ala Arg Tyr Phe He 
130 135 140 

Leu Arg Glu Thr Val Tyr Leu Gly Gly Val His Tyr His Pro His Ala 
145 150 155 160 

Ala Gly Gin Tyr Glu Gly He Ser Tyr Asn Ser Pro Val Leu Ser Ser 
165 170 175 

Pro Met Lys Gin He Thr Glu Gin Lys Pro Val Pro Gly Cys Pro Ala 
180 185 190 

Ser Ser Ser Val Phe Glu Phe Pro Ser Gly Gin Ala Phe Leu Val Gly 
195 200 205 

His Leu Gin Asn Leu Arg Leu Asp Ser Gly Pro Ser Pro Ala Ser Pro 
210 215 220 

Leu Ser Ser He Ser Ala Pro Asn Ser Thr Asp Ala Thr His Leu Lys 
225 230 235 240 

Phe His Pro Val Phe Val Pro His Ser Ala Pro Ala Val Leu Thr His 
245 250 255 

Ser Asn Glu Asn Arg Ser Asn Cys Val Phe Glu Phe His Ala Gin Thr 
260 265 270 
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Pro Ser Ser Ser Ser Gly Glu Gly Gly Gly lie Leu Pro Gin Arg Val 
275 280 • 285 

Tyr Arg Asn Arg Gin Val Ala Val Asp Ser Ser Gin Glu Glu Leu Ser 

290 295 300 

Pro Gin Ser Val Gly Leu His Cys His Leu Gin Pro Val Thr Glu Glu 
305 310 315 

Gin Arg Asn Asn His Thr Pro Glu Leu Thr He Ser Val Val Glu Pro 
325 330 335 

Met Gly Gin Asn Trp Pro Val Gly Ser Pro Glu Tyr Ser Ser Asp Ser 
340 345 350 

Ser Gin He Thr Ser Ser Asp He Ser Asp Phe Gin Ser Pro Pro Pro 
355 360 365 

Thr Gly Gly Thr Ala Ala Pro Phe Gly Ser Asp Val Ser Leu Pro Tvr 
370 375 380 

He Arg Leu Pro Gin Thr Val Leu Gin Glu Ser Pro Leu Phe Phe Cvs 
385 390 395 400 

Phe Pro Gin Gly Thr Thr Ser Gin Gin Val Leu Ser Ala Ser Tyr Ser 
405 410 415 

Ser Gly Gly Ser Ala Leu His Pro Gin Val He Gly Lys Leu Ser Gin 
420 425 430 

Phe Phe 



<210> 11 

<211> 486 

<212> PRT 

<213> Takifugu rubripes 

<400> 11 

Ser His Phe Glu Cys His Ser He His Cys He Ser Leu Ser Gin Gin 
1 5 10 15 

His Ser Val Pro His Pro Tyr Ser Gly Gly Ala Gly Gly Gly Ala Gly 
20 25 30 

Gly Asn He Pro Leu Ala Ser Leu Pro Asp Pro Thr Thr Val Leu Phe 
35 40 45 

Pro Ser He Pro Glu Arg Pro He Ser Phe Ser Pro Pro Pro Thr Met 
50 55 60 

Pro Pro Lys Ala Tyr Asn Asn Gin Arg Arg Lys Ser Thr Ser He Leu 
65 70 75 80 

Glu Ala His Thr Arg His Phe Gin Pro Ala Tyr Thr Arg Tyr Gly Ser 
85 90 ^ 95 

Asn Leu His Pro Phe Ser Gly Met Glu Thr Val Glu Pro Pro Pro Leu 
100 105 110 
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Phe Met Val Asn Pro Gly Phe Ala Ala Ala Ala Gin Arg Leu Gly Val 
115 120 125 

Gly Glu Pro Leu His Leu Pro Gly Gin Ser Asp Pro Ser Leu Tyr Gly 
130 135 140 

Tyr Lys Asp Met Arg Ala Glu His Gly Glu Ala Met Arg Arg Leu Ser 
145 150 155 160 

Leu Asn Gin Ala Ala Leu Leu Asp His Tyr Glu Ala Met Ala Tyr Gly 
165 170 175 

Gly Tyr Pro Met Thr T^a His Gin Leu Ser Arg Leu Ser Phe His Gin 
180 185 190 

Gin Arg His Ala Ala Ala Ala Ala Ala Ser Phe Ala Phe Glu Gin Pro 

195 200 205 

Gly Phe Leu His Pro His Leu lie Gin Arg Met Ser Ala His Ser Pro 
210 215 220 

Val Pro Pro Pro Leu Ala Pro Leu Ser Ala Pro Ser Gly Thr Ser Met 
225 230 235 240 

Ser Ala Ser Ser Glu Gly Cys Tyr Pro Ser Pro Gin His Ala Ala Val 
245 250 255 

Ser Thr Phe Ala lie Ser Ala ?ro Pro Val Met Pro Asp Ala Ala Pro 
260 265 270 

Pro Thr Gly Asn Val Phe Glu Phe His Leu Ala Ala Ala Gly Asp Pro 
275 280 285 

Asn Leu Met Ala Ser Arg Leu Phe Arg Ala Arg Arg Ser Ser Met Asp 
290 295 300 

Leu Pro Leu Glu Asp Asn Thr Gly Ala Gly Ser Gly Gly Ser Gly Thr 
305 310 315 320 

Tyr Val Arg Leu Gin Pro Val Thr Glu Glu Leu Tyr Ser Tyr Val Ser 
325 330 335 

Pro Glu Leu Pro Leu Pro Pro Gly Ser Leu Phe Leu His His Ala Gly 
340 345 350 

Leu Thr Ala Lys Asp Lys Ser Pro Asp Pro Ser Ser Asp Ser Leu Ala 
355 360 365 

Ser Ser Asp Asn Gly Glu Phe Gin Ser Pro Pro Pro Pro Pro Leu Leu 
370 375 380 

Pro Ser Phe Asp Ser Ser Ser Ala Gin Ser He Pro His Thr Ala Ser 
385 390 395 * • 400 

Thr Ala Met Tyr Asp Pro Ser Thr Gly Met Leu Pro He Asp Pro Gin 
405 410 415 

Gly His Pro Pro Ser Met Gin Ser Phe Leu Pro Pro Thr Pro Ser Ser 
420 425 430 
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Glu Leu Pro Leu Gl^^j/ Ala Pro Ser Thr Gin Leu Glu Thr HWr Met 

435 440 445 

Gin Ser Ala Trp Ala Arg His Gly Gly Val Val Pro Ala Gin Pro Asp 

450 455 460 

Met Thr Tyr His Glu Ser Leu Leu Ala Met Gin Ala Val Asn Pro Ser 
465 470 475 480 

Leu Val Leu Leu Arg Phe 
485 



<210> 12 

<211> 13546 

<212> DNA 

<213> Homo sapiens 

<400> 12 

ccctcaagta tctacaaaga tggtagggaa atgcataatg cagattttac agttcggtta 60 

aggccagtcc tgagagaatt ctttcttcct cctcttctct acttttttta caatgccttt 120 

tttttttttg gcggggggtg gtggtggggg gtgttgtttt ctttcaatat actactgctt 180 

aatttaccct tttattctgt aggtagagtc tgggtatgtc tgtgaaggtg atcacaagac 240 

catggctaaa gctatcaaag acagagtatc attaattaag aggaaacgag agcagcggca 300 

gttggtacgg gaggagcaag aaaaaaaaaa gcaggaagag agcagtctca aacagcaggt 360 

agaacaatcc agtgcttccc agacaggaat caagcagctc ccttctgcta gcaccggcat 420 

acctactgct tctaccactt cagcttcagt ttctacacaa gtagaacctg aagaacctga 480 

ggcagatcaa catcaacaac tacagtacca gcaacccagt atatctgtgt tatgtacgta 540 

tcttgggaag tggacagata ggctaatgat tctcctaatt ggttttttta gcttcttgtt 600 

attttgcact ggctttttgg ttttgaatta tgaatcctac tattatacaa aatttgaaat 660 

ggttgtcgtc tagcttctct ttattcttat atctgtgact aggagaatag aaggatgggg 720 

tggtaaggaa atgtgattag gatacaaaat aattttataa gcttcaacat cttggaggtt 780 

tcaatttgta gatttctact tttcacaatt tatttaaata aaacagattt aataatgaag 840 

aatgaacaag cttggctact aaaattgaac ccatcagtct ggttgagctc ttttagcaaa 900 

gctgaatggt atatagacta aaaactaatg agaacactgg ttattaattt aattacttgt 960 

ttaatactga gaacctcctg gcctccataa ttaagacttt gaaagctgtg tcaaatttaa 1020 

ggcttatagt gtttcatgct gtttccctta ctcctcaggt ggtaagattc tgtgatagcc 1080 

atatggattc ctcatttaaa tatagaaatc tactgaatat ggctacaatt cattttttat 1140 

ttagttcaaa tctgtgcaag gaataactag ttacccctaa tataatgggt ctaaatatga 1200 

gaaaatgtga acctccattt tgtgatttgt ctttctctct ctcttttttt tggcgattca 1260 

tttttccttc agctgatggg acggttgaca gtggtcaggg atcctctgtc ttcacagaat 1320 

ctcgagtgag cagccaacag acagtttcat atggttccca acatgaacag gcacattcta 1380 

caggcacagt cccagggcat ataccttcta ctgtccaagc acagtctcag ccccatgggg 1440 

tatatccacc ctcaagtgtg gtaagtaaat gcttaagagc atgtaatact acaatgagag 1500 

ccaatggata gtctgctaaa ttaaaattct ttgtacaaac caagtaacag tatgagtctg 1560 

aagtagaaaa tataaaatgc tgtgtctcta aagccttatt atagagtagg atatagacat 1620 

taatgtgtgg gcattagctg aactgcctac atgaagcagc tatgtgcaag atttagatta 1680 

gtctgcattt ttactaaagg gaacgctctc tttagcactt aagatgttga aactattcac 1740 

agcaacaaaa gctggacata caaatgtttt gcgggtgggg cgagaagagt ttttcaaaag 1800 

cttgttaaaa tgattactaa gtatttcctg taagattccc tattgtacaa aacacaggta 1860 

agaccttcaa accattatag aatgctgcat gtgagtactc tagattgagg gaatccttta 1920 

tttgaatgac tgtttaattt agcgcttgtg ttgagttaat catcagttgc tgtgtgtgtt 1980 

cctgcctttt aaagtctgtg ttggacacaa gttccttccc cagtatatgt tcttttgctt 2040 

tttcccttta gttggtaact gttttattac tgatttattt tcagaatcaa cttatagaaa 2100 

ttatttcctg aaacatttta tgcttgtata cacatacaca cacacaatta tcctgtgttt 2160 

tacattagac actcttgttt ctcatataga gaatgttaaa gtagatcatc agataatacc 2220 

ccatttcttt tgttgacctc agaggaaaat aaaatacgtc atctgcctct gaaaggtatt 2280 

agaaaataag taacttgtca ctgtacagca cttgtttagt gagatcttta agcagacaga 2340 

ttttcccctt ctcagaaagc ttttaatcta aactttgtgg aattcttgct cttctttaaa 2400 

tgaaaatctt tctctgttaa gaaaatactt ctgcttgttc gattttttat cctacagctt 2460 
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ggaataaaag 
aatgaatggc 
ctttctgtct 
aatatatttc 
taattgtttt 
ttctcttgtc 
atgacatgaa 
agacgttctt 
ttgagaattt 
ataatatgga 
gagaaatcaa 
tggtgggcca 
tagtttcata 
tggtattgat 
tttttttttt 
gatcttggct 
cccagtagct 
agtagagaca 
tccacctgcc 
taaattatta 
acaaaatatt 
agatcatgac 
caacaacttt 
agagctcttg 
taatagaatt 
ttttccctgt 
ttggtgttat 
cctgtccttg 
tgttgagtta 
acatttcttc 
gtactgtgtt 
tcggtttgtg 
ttccacccca 
gaggaaactt 
gcctctcagc 
aactgtaaac 
ttttaattaa 
ccagttattc 
gaggtaatga 
atttaccttt 
tctagtttta 
acattaccca 
gagtcctttt 
ttttaattta 
cccccatgga 
atgcattaga 
gatctatctt 
tatagttaac 
agaactatac 
gcattctgac 
cagctcgata 
aggctgtgat 
tctcgctctg 
ctcctgggtt 
tgccaccaca 
aggatggtct 
attacaggcg 
catttggcaa 
agtgctatca 
aggatgtcag 
cctagcactt 



atgttca! 
cacagtaaca 
ttaaaataat 
tgtgttctga 
ttttctggat 
tatactcatc 
gatattattt 
gctttttaat 
aaaaatttgt 
caaattataa 
aacaggaatg 
cagatttggc 
gttcacaaca 
catactgcct 
tttttttgac 
cactgcaact 
aagattacag 
cagtttcacc 
tcggcctccc 
tttagagatt 
tcctatatcc 
ttttttacct 
ccatctcatg 
caaaataatc 
cattcttgtt 
ctgcaccctt 
ttcagaatat 
actgacaacg 
tagctctcct 
atgcaactat 
tttcatgtgt 
ttcccatctt 
ccgccagtac 
ttgccgaaaa 
gcaagcaccg 
ttctgacaaa 
aattgaataa 
ctgtaggaaa 
gagagaatac 
cattatatct 
aaaggattaa 
aacatttttt 
tttttttaat 
atcatttatg 
agtatttttt 
tgcatatata 
ataaaattat 
tttctgaact 
agtgttatta 
tgaaaataga 
aggatcttcc 
agtgaattct 
tcgccaggct 
caagtgattc 
cccagctaat 
tgatctcctg 
tgagccaccg 
gctgtgtttc 
aataaaattg 
aatcagaaaa 
tgggaggccg 



gtttcagagt 
caaataaaaa 
tgagggtaga 
caaatattac 
tattataact 
atgggggagc 
gagccattaa 
gacagtattc 
ctaattctat 
gtgaaatttt 
aaactaaaca 
ccaaagcatt 
tctaaaagga 
tatagccata 
agcgtctcac 
ttcacctcct 
gcacccatca 
gtgttggcta 
aaagtgccag 
cagattgaag 
agttgtgaat 
ttacaaataa 
tcttctaaaa 
caatcttaaa 
gaatcttaag 
cagtcagttg 
ttagcatgat 
cagaatagtt 
caaaaaaaaa 
gccctgttat 
gtgtttgttt 
tctgctgttg 
tgtctgcacc 
gctttctaaa 
acgctCGagc 
tgaacaatta 
agaacaggac 
cttttttgta 
ctataatgtc 
tcctgtttcc 
ttttaaagtt 
ctcctcactt 
aaccttagtc 
taagatgatg 
ccttaggttt 
gtatgtatga 
tttaagcatg 
agcaacccca 
tggctttgag 
aatccttatt 
ctatggaaga 
tactattatt 
ggagtgcagt 
tcctgcctca 
ttttgtattt 
acattgtgat 
cgcccagcgg 
taaatttagt 
gctgtcattt 
ttcatggtta 
aagtgagcag 



ccgtttaatc 
actaatatac 
atagaagaac 
ttgtcagcat 
agattttgct 
attaggtctc 
tctagctata 
tatggaggta 
tcttgtctca 
agaaattaga 
atcatatgat 
ctacacagcc 
aaataacagt 
ttaaaccctg 
tctgtctccc 
gggttcaagt 
ccacgcccag 
ggctggtctt 
gattacaggt 
atattctccc 
ttcttctaga 
agagtggcaa 
tagaaatgag 
attctctgta 
tttgccaaag 
gtcattctta 
aactgatcat 
gtgaatagtt 
actgactttg 
cattgtactt 
tgtgttgagc 
cctctgtgtc 
tctttctcct 
gcattggaga 
ctgccttccc 
ttaccaatga 
taaacttggt 
ataaggcttt 
cttggcacat 
tatcgacata 
attgccttaa 
tattctaagg 
agcaaagaat 
tatcagcaaa 
caacctaagg 
ctttctttgg 
tgggtccttg 
gaattttcag 
acataccatg 
tagctattgt 
atgaaactga 
tacttattta 
ggcacgatct 
gcctcccgag 
tagtagagat 
ccaccccctt 
gtaatgaatt 
cttaaatcta 
atattaccct 
cggccgggct 
atcacctgag 



ctgctttggc 
taacttttta 
tttataaaga 
ttcttgtcag 
aagacttttt 
tattttagaa 
tttattgatg 
ttatgaagaa 
tttcaggaat 
ataaaagaaa 
gatatgtaaa 
aagttttgga 
cttattcaag 
tgcctgaaca 
aggtgggaat 
gattctcctg 
ctaatttttg 
gaactcttga 
gtgagccapc 
attgattact 
aagctaaact 
gtataagaaa 
aatctttttc 
tcttttaatg 
ttgcaagcta 
aattcataca 
tatcatgatc 
gtgatctgtc 
tggaattggg 
ttgtttattt 
ctcgtcgtgg 
ccgcatctct 
tccctcctcc 
gtgtcetgcc 
tctttgtcag 
atacatccag 
tatattatgc 
aagggggaat 
accagcatgt 
aaattctttc 
tcaaaagcaa 
agacactaat 
aaccttggag 
ccaaagaact 
tgggactgta 
ctccattaat 
ctgcttttcc 
atttttactt 
agtaaataac 
gaacagttga 
attcttttaa 
tttatgtttt 
cgactcacgg 
tagctaggac 
gggtttcacc 
cagcctccca 
attatagtgt 
gaactgcttc 
tttcagttga 
cagtggctca 
gtcaggagtt 



# 
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tctcaa 2520 



aaacaatcta 
ctgcacagaa 
tttctacctt 
ttttaaaatt 
agcaagtaga 
ctctaaaagc 
ttattaaata 
gtggcttaca 
taaggctaga 
tacttggtat 
aatgaaaaca 
acaaccttta 
tttttttaag 
acagtggcgt 
cctcagcctc 
tatttttttt 
cctcaagtga 
atgcccggcc 
ttgtaaaagt 
ttttcctagt 
tactgtgagc 
tgaggtttcc 
tactttaaac 
atattcattg 
cttgtaacac 
attcattaaa 
agatgtgcta 
agaggatgga 
tgttttcaca 
ccgtagcatg 
cccagtgctc 
ggactgcccc 
tatgcactct 
tactgtatgt 
gcaacaagaa 
t-tctaggata 
aaaaagcaag 
gtccaata^a 
attttcatat 
attttatcaa 
aaaactggat 
tgaacattta 
acataatgtt 
gctctaatga 
gtttataaaa 
aaataaagaa 
aggagaatga 
aataaaatga 
tctactttgc 
aaagcggggg 
ggagatggaa 
caacctctgc 
tacaggcgca 
atgtttggcc 
cagtgctggg 
atattgtttg 
tagaataaaa 
ttagaacacc 
cgcctgtaat 
cgagaccagc 



2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
6060 
6120 
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attatatatc 
attaattgat 
agggaaagtt 
ggcttttatg 
ttacattaat 
aaagtttaat 
aaagcaaaga 
atgcacataa 
tttcataggg 
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ctggccaaca tggcgaaJ^Mp:gtctctact 

cgcacgcgtg tagtcccagc tacttgggag 
aggcggaggt tgcagtaagc caagattgtc 
actccatctc aaaaaaaaga aaaaagaaat 
agaaaaatgg acattttgtg atttaactct 
aaaactatta atcatctagc ttgggagaga 
ttttactatc caccaaaaat 
aagaaaataa aactgaaggc 
agttcaatag tttaaattaa 
ccttgtttta gttcttactc 
gcttaagagt gaacttttta 
tcttccattt cctttgagag 
gcccacagat tgatttcatt 
tttccagaag cattgttatt 
ttacacagaa ctacccagtt 
tgtatgaatt acttgtctta ttcatgttga 
gggggacccc aacataccca gaatcacaga 
tttctttttc accacctccc acctgcccac 
gcacctcctt cctggaagcc caaactcacc 
aaagtcttct tccacctggt ggcagcccaa 
tgggtactac agccagtaga gtaactggag 
ttgaaccatc tcaagtttac agtgactata 
actattttat tcctcaggaa gcagtgtatg 
cagaacagta tgagggcatt ccatacaact 
tacctgaaca gaagccagta caagggggcc 
ctggacaggc tttcctggta ggacaccttc 
cgggatctcc cctctctagt atttctgcac 
ttcaccctgt ctttgttcct cattctgcgc 
gaagcaactg tgtatttgaa tttcatgttc 
gtggaatttt acctcagcgt gtttaccgaa 
aagaactgcc tcctcaatca gttggattac 
agcataatta ccatgcccca gaattgaccg 
ggccaatagg aagcccagaa tattccagtg 
gtgattttca gtcacctccc cctacagggg 
caatgccctt tatccatctg cctcagacag 
tcccccaagg aaccacatct cagcaggtct 
acaggttata ggaaaacttc 
taaattttct attccttatt 
atqagtagag tttccccatc 
aatgtgagag aagctacctg 
aaactagaat tttagtcatt 
tttcaaccag tagtgaaaat gcaagtgtat 
gaatacaaat cttaagtatt actttcttcc 
ttttttttct ataaaatata aaacatggaa 
aataatacac agaattttca ttagtgtcga 
aataaaaaaa gatagaaaag gaaacagttt 
ttttaaaatg gcatcagtca agcaagttgc 
taattcataa aattaaggac tgattctaat 
ggcttttttt ttttgttata agaatttgtg 
cttaggcctt gtattttcct caacaaattt 
acacacttag caaataaaga caatgtaatg 
gtttactgtt tcatatattc atttggtata 
tacatagaag actatgcagt tatatattga 
tgtgttgttc ttcctacttt cttcctatgt 
caaaatgtgc acaaattatc aattatttag 
gcttgacagc tatacatggg catgcagtat 
attataaata ttaatgtatt gccagtaatt 
caggtagtta atttatagac atactacaga 
agtggaatgt gatattatat ccatgtaagc 
atgacaaaag tggtagcctt ttcatttcgt 
tctcactttc tagtgagaaa cactgtagtt 



cacttcatcc 

ccataacatt 
gtttaatttt 
ttaaaaacaa 
aaaaacacaa 



aaaaatacaa 

gctgaggcag 

actgcactcc 

ttcatggtta 

gtaacatgtt 

taggagaaag 

ctcttctgat 

ctctaacata 

agcacacttc 

ttcctttaac 

agtgtgggta 

aggaaagtta 

ggccctggat 

tatttattaa 

gtgcatgtct 

tacagcctca 

tatttttccc 

cgaaagtagc 

acttccaacc 

ctaactggac 

agtcatgtga 

gacctggact 

tagctggggt 

catcagtact 

ctacttcaag 

agaatctaag 

ctatcagtac 

ctgctgtgtt 

acacaccaag 

atcggcaggt 

atggctactt 

tttctgtggt 

attcctcaca 

gagcagctgc 

tgttacaaga 

taactgcctc 

cacaattatt 

tccctgaatc 

tttggggggt 

atttacctat 

cttcacaatc 

gtaatgtatg 

tccaaacaag 

aatagggaat 

aggatctaaa 

aggaatttgc 

tgtgcattat 

ttttagcaac 

atatgttgca 

aattttagtt 

aatgtaatag 

ttttatcaaa 

agacatttgg 

tcttccctca 

tgctgtacta 

attcacatca 

atcctttaga 

aggaatcaat 

tttaaggaac 

taacatacct 

ggggtttctc 



aaattagctg 

aagaatccct 

agcctgggcg 

tgc?iactctt 

tcatgtagta 

acattactgt 

ttctggttag 

tcacagggta 

ttacagtata 

tctttttctg 

aaaacgaaat 

tggaaaagca 

gtatttaatg 

ttataaattt 

gatgtaattt 

gtccatggcg 

aactattcat 

catttcccag 

cctgctgagg 

accagaggcc 

gatacaggtc 

agtacttcca 

acattaccag 

gtcaagtcct 

ttctgtcttt 

attagattct 

agatgctaca 

aactcataac 

ctcctcttca 

tgcagtggac 

gcagcctgtg 

agagcctatc 

aatcacttct 

acjcttttggc 

atccccactt 

attttcttca 

ttaaactacc 

atggattttg 

tgtgaactac 

tatatgtgaa 

acgacttcta 

gttgacccag 

tttttaaaaa 

gctgtttttg 

aagacaaagt 

cttaacaaat 

tatatgtcca 

taacaatcta 

agctaaggaa 

atttatgtgt 

aattcaaaag 

tgtgtacatc 

atatcctaca 

tcaggagcat 

agacacaaaa 

gtattcaagt 

ctttccaaag 

atttat^ata 

cccttgtaaa 

atatatagtc 

agactcaaca 



T/CA2003/001716 
ggtgg 6180 



tgaacccagg 
aaagaggaag 
atttatgatc 
aaaatataat 
cactagtcaa 
aaggcatact 
ataagaatat 
gaactagtcg 
ttgatgtaat 
aattacttac 
gctcttatct 
gatttttact 
agtgtaacca 
cacatatgaa 
catccgtgtg 
gaacgtccag 
cggcgtaaga 
actgttggcc 
gtagttatgt 
catcctatgt 
gaagaagctc 
gcccgggtgg 
atgaaacaga 
gaatttccat 
ggattgggtc 
cgtttgaaat 
aatgagagca 
ggagaaggag 
ttgaatcaag 
actgaagaaa 
ggacagaact 
tcagacccca 
tctgacgtct 
ttcttctgtt 
ggaggatctg 
ctactttgca 
gagaaatatt 
atatatgcat 
aaccagtgga 
tgcacgttat 
cattatttag 
ataggataaa 
aggtaatatt 
atatcatggg 
gaaaatgcct 
aataaaatgc 
gtgagtggtg 
cagtagacat 
atgagcatat 
tacagtcagt 
ttataaaatc 
taaagtacta 
aaacaactcc 
agatccagaa 
tgcagcccct 
taaaaaatac 
gttaagctgc 
ctgaaatgga 
ctgcatcatt 
ctgttgacat 



6240 

6300 

6360 

6420 

6480 

6540 

6600 

6660 

6720 

6780 

6840 

6900 

6960 

7020 

7080 

7140 

7200 

7260 

7320 

7380 

7440 

7500 

7560 

7620 

7680 

7740 

7800 

7860 

7920 

7980 

8040 

8100 

8160 

8220 

8280 

8340 

8400 

8460 

8520 

8580 

8640 

8700 

8760 

8820 

8880 

8940 

9000 

9060 

9120 

9180 

9240 

9300 

9360 

9420 

9480 

9540 

9600 

9660 

9720 

9780 
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wo 2004/043999 
ctgaggttgg ataattct' ^ 
catccttggt ttatacctcc 
agaaatgtct ccacatattg 
gcaatgatcg ccctaggtac 
aaagtcattg tggcactaca 
aattcttcag ataaatttca 
ctaataacaa attttattat 
agcattgaca gcttaggttt 
gttgtgtggt atgatccttt 
catacataat caggttaatg 
cattgcaaaa gcctgacctc 
cttctttcct tcaggcacag 
gggtttcatc ttcccaaccc 
ttttatgaat tagcagtggc 
gcctgctaag tttttgaaag 
agaaaaaaaa gagacttttc 
acaatttagt agacatacat 
cctttttttt gagacaaagt 
tagctcactg caaccaccgc 
tagctgggac tacaggtgcc 
cggggtttca ccatgctggc 
cctcggcttc ccaaagtgct 
acacataatt tacggtacaa 
agcagtgatt tgtcattacc 
attatttcta atctgtagaa 
cctaagaaca attacttcaa 
agccttgctc tgttgcccag 
catctcccag gttcaagcta 
gcgtgccact atgcccagct 
tagagatggc gtttcaccat 
cacccgcttt ggcctcccaa 
tttgttttgg ggggcttgtt 
atacatccaa cagtaaaact 
ttagttaata gtaattattt 
tagctattct acataaaatt 
ttccccatgt gggtttttta 
tggttttgaa acaaagtttt 
tggtcatgta gtcattcgat 
cacttaaagg cactttctac 
gtgaactgaa ggactgggac 
gactggttaa attcaagtaa 
ttaacagttc ctaattagaa 
tactgagtac taatctccta 
agcaaattgt atgtgctctg 
tggggatatg tatgtatact 
ttcatgttaa gaaagagtgt 
gaattaggta gctacctctc 
aaacaaaaat taatttttgt 
ttgtagaatc acaaaattga 
ttaacttctc caaagtcact 
gattcctaga atagtattgt 
tctgcaaatg tacctcagaa 
aaaaaagaaa ggttgaaaac 
ttgttttacg tactttcagg 
ttagtgagga aagaaatata 
atagtaaatg aggaattgct 
agatcttagc aaggtaccaa 
aaagttgtga actttgtcag 
acttgagccc aggagttcaa 
gggcaacaga gcatgactca 
cctactatcc ataaagtctg 



ttgtggtgg 
taaatactag 
ccaagtgccc 
ttggcagttg 
agaagtatta 
gattgatagt 
gaatctgacg 
tcactgtcag 
tctaaatttt 
tttcattttg 
tatacactta 
gggcagagcc 
atacaacatc 
cagaacacta 
gaaaattaag 
tgtgtttcag 
tattactgat 
ctcactctgt 
ctcccaggtt 
tgccaccaca 
caggctggtc* 
gggattacag 
agagtatgtc 
ttcattgtgc 
tgatgtgtca 
gaacatttat 
gctggagtgc 
ttctcacgcc 
aatttttgta 
gttggccagg 
agtgctggga 
ccagaaagtt 
ggtaattcta 
gcaagggaat 
tgtagccact 
atcagcaaac 
cggttcttaa 
actggtttaa 
agaaacagtt 
ttatatgctt 
acatattttt 
tgcaaattac 
acttgcttat 
ttttttcttt 
tactatcaaa 
gagaaaagaa 
caggagaaaa 
ttcagtgatg 
gccctcgata 
tagataataa 
ttcttccatt 
gctgtcttta 
tttttgtact 
tcccaatccc 
cttttgctaa 
tttttgtgtg 
aaatagctgc 
acacagtgcc 
ggctacagtg 
tctcaaaaaa 
tttgaactgg 



gctgttctat 
tagcagcaat 
ccctgggggt 
atgataacag 
ccaaattagc 
attaatactt 
attctagcca 
tgggttttgc 
tcttacaaaa 
atcaagcaaa 
ctttaggcct 
agggtcagcc 
ctcagcaggt 
ttttaacttt 
aggcatacca 
aaaaatctct 
ataattatct 
cacccaggct 
caagcagtcc 
ctcagctaat 
tcaaactcct 
gcatgcgcca 
attccttttc 
ttgcctctac 
gattttacgc 
ttatttattt 
agtggtgcaa 
tcagccgccc 
ttacgttcta 
ctggtcttga 
gccactgtgc 
caaggaattc 
aaattaaata 
aatgcatttt 
atgtatgtaa 
tctaaactgg 
ttacaaatgg 
gatcaggagt 
gattagtttg 
tgttttgcac 
gccaacctta 
aagtcacata 
ctttggatgt 
tccaaaggaa 
tgaagaagat 
gagagcagta 
aaatatggct 
tttttgtttt 
tatataggaa 
gggagctaca 
ttaaataaag 
tttgttttaa 
atatggatag 
accttagaga 
aatttacttt 
tttttgtagg 
ctcctgctga 
agctacttgg 
aactatgata 
aaaaaagcac 
tttcatgagg 



gtattgtagg 

ctccctcagt 

ggcaggattg 

^ttttttgat 

attaccaaat 

tgttcctgat 

gctaaaatta 

atgggttggg 

gttgacccag 

ctctgaggag 

tctctaattt 

atcctcaagt 

gagaacaatg 

tgtgatttgt 

tgtcttgtgt 

tttgtttaat 

ttttaaaaga 

ggagtgcagt 

tcctgcctca 

ttttgtattt 

gacctcaggt 

cctcacccag 

tctttttttc 

taatcctgca 

taagaagcat 

atttttattt 

tcttggctca 

gagtagctgg 

cttatgcctg 

actcctgacc 

ccagccaaaa 

aagagattag 

tttatagtgc 

tggcataatt 

tgcaaacttc 

cttctgtttc 

ccctttttca 

tgcttgtaca 

tttatacatg 

ctgtcctttc 

acaaaatctg 

aagctggttg 

agacgaacac 

ttggttatct 

aataaaataa 

ccataaatgt 

aaatgctatc 

agatactttt 

gtgtattcgg 

aattataacc 

ttctgcaggg 

ttattggagt 

tctattttta 

accatatctc 

atttcatgcg 

gtctcgtata 

aaagctttgt. 

aaggctgagg 

ttgccaccat 

cgaagttgtg 

caagcttggg 



, yCA2003/001716 
tagaa 9840 



tgtgacaacc 
dtcctggata 
acttttgata 
acctttcttt 
tattgtattt 
cagcctgagc 
atcagaacat 
aggcctctcc 
attaaatatt 
gttgtgttca 
agcttaacag 
cattgcaaca 
tatataaatg 
tcagtaacaa 
atacttttga 
ggattctaca 
ggtgcaatct 
gctttccacg 
ttagtagaga 
aatccaccct 
cctggattct 
cactgcttgc 
cttcttttta 
ctaatgaaaa 
attttgagac 
ctgcaacttc 
gactacaggc 
gctaatttag 
tcaagtgatc 
tattttaact 
tgtcatcctg 11700 
atcactgata 11760 
gaatacttat 
ctaatgttag 
tattagtttt 
gtccaacata 
agatgaaaat 
cattcactcc 
ttccaaacaa 
ttttattcta 
agaatggaag 
caatgaactg 
gtcttgtatt 
ttcattgagt 
cagattaaaa 
tctacagcag 
atagcatgtg 
tctagaataa 
cagaattcaa 
acttaatgat 
tccacgtttg 
gacatgattt 
ttgttttatt 
tttcttgaac 
gtgatatact 
gaccacaaac 
cagga agate 13260 
actccagcct 13320 
aactacctac 
cagagcatga 



9900 
9960 
10020 
10080 
10140 
10200 
10260 
10320 
10380 
10440 
10500 
10560 
10620 
10680 
10740 
10800 
10860 
10920 
10980 
11040 
11100 
11160 
11220 
11280 
11340 
11400 
11460 
11520 
11580 
11640 



11820 
11880 
11940 
12000 
12060 
12120 
12180 
12240 
12300 
12360 
12420 
12480 
12540 
12600 
12660 
12720 
12780 
12840 
12900 
12960 
13020 
13080 
13140 
13200 



13380 
13440 
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tgctaaactg ttaagct^^p::cacccctgc ctggctgaag tgctgattat tipibagtgg 13500 
gactgcgcac agcagatcat catgtacttc tggcatatcc agaggt 13546 



<210> 13 

<211> 509 

<212> PRT 

<213> Tetraodon nigroviridis 
<220> 

<223> Partial amino acid sequence 

<220> 

<221> misc^feature 

<222> (326).. (397) 

<223> X = undetermined amino acid 

<400> 13 

Leu Cys His Ala Leu Ser Phe lie Ser Ser Gly Leu Cys Trp Gly Phe 
15 10 15 

Ser Leu Phe His Ser Glu Cys His Ser Glu He His Cys lie Ser Leu 
20 25 30 

Ser Gin Gin His Ser Met Pro His Pro Tyr Ser Gly Gly Ala Gly Gly 
35 40 . 45 

Gly Ala Gly Gly Asn Val Pro Leu Ala Ser Leu Pro Asp Pro Thr TVla 
50 55 60 

Val Leu Phe Pro Ser He Pro Glu Arg Pro He Ser Phe Ser Pro Pro 
^5 70 75 80 

Pro Thr Met Pro Pro Lys Ala Tyr Asn Thr Gin Arg Arg Lys Ser Thr 
85 90 95 

Ser He Leu Glu Ala His Thr Arg His Phe Gin Pro Ala Tyr Pro Arg 
100 105 110 

Tyr Gly Ser Asn Leu His Pro Phe Ser Gly Met Glu Ala Val Glu Pro 
115 120 i25 

Pro Pro Leu Phe Met Val Asn Pro Gly Phe Pro Ser Ala Ala Gin Arg 
' 130 135 140 

Leu Gly Val Gly Glu Pro Leu His Leu Pro Gly Gin Ser Asp Pro Ser 
^45 150 155 160 

Leu Tyr Gly Tyr Lys Asp Met Arg Ala Glu His Gly Glu Ala Val Arg 
165 170 175 

Arg Leu Ser Gin Ala Ala Leu Leu Asp His Tyr Glu Ala Met Ala Tyr 
180 185 190 

Gly Gly Tyr Pro Met Ala Ala His Gin Leu Gly Arg Leu Thr Phe His 
195 200 205 

Gin Gin Arg His Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala 
210 215 220 

Ser Phe Gly Phe Asp Gin Pro Gly Phe Leu His Pro His Leu Val Gin 
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wo 2004/043999 ^ ^■t/CA2003/001716 

225 235 240 

Arg Met Ser Ala His Ser Pro Val Pro Pro Pro Leu Ala Pro Leu Ser 
245 250 255 

Ala Pro Ser Gly Thr Ser lie Ser Ser Ser Ser Glu Gly Cys Tyr Pro 
260 265 270 

Pro Pro His Ala Ala Ala Ser Pro Phe Ala Val Ser Ala Pro Pro Val 
275 280 285 

Met Ala Asp Ser Ala Ala Pro Pro Thr Gly Asn Val Phe Glu Phe His 
290 295 300 

Leu Ala Ala Ala Gly Asp Pro Asn Leu Met Ala Ser Arg Leu Phe Arg 
305 310 315 320 

Ala Arg Arg Ser Ser Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
325 330 335 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
340 345 350 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
355 360 365 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
370 375 380 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Phe Gin Ser 
385 390 395 400 

Pro I!ro Pro Pro Pro Leu Leu Pro Ser Phe Asp Ser Ser Cys Ala Gin 
405 410 415 

Ser lie Pro His Ser Thr Ser Thr Ala Thr Tyr Asp Pro Pro Gly Gly 
420 425 430 

Met Phe Pro lie Asp Pro Gin Gly His Pro Pro Ser Met Gin Ser Phe 
435 440 445 

Leu Pro Pro Thr Pro Ser Ser Glu Leu Pro Leu Gly Gly Ala Pro Ser 
450 455 460 

Thr His Leu Glu Thr Leu Met Gin Ser Ala Trp Ala Arg His Gly Gly 
465 470 475 480 

Val Val Pro Ala Gin Pro Asp Met Thr Tyr His Glu Ser Leu Leu Ala 
485 490 495 

Met Gin Ala Val Lys Pro Ser Leu Val Leu Leu Pro Phe 
500 505 



<210> 14 

<211> 2785 

<212> DNA 

<213> Homo sapiens 

<400> 14 
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wo 2004/043999 

cgtggtggcg cacgcgt^|^^tcccagcta 

aacccaggag gcggag'gttg cagtaagcca 
agaggaagac tccatctcaa aaaaaagaaa 
ttatgatcag aaaaatggac attttgtgat 
aatataataa aactattaat catctagctt 
ctagtcaaat tatatatctt ttactatcca 
ggcatactat taattgataa gaaaataaaa 
aagaatatag ggaaagttag ttcaatagtt 
actagtcggg cttttatgcc ttgttttagt 
gatgtaattt acattaatgc ttaagagtga 
ttacttacaa agtttaattc ttccatttcc 
tcttatctaa agcaaagagc ccacagattg 
tttttactat gcacataatt tccagaagca 
tgtaaccatt tcatagggtt acacagaact 
catatgaatg tatgaattac ttgtcttatt 
tccgtgtggg gggaccccaa catacccaga 
acgtccagtt tctttttcac cacctcccac 
gcgtaagagc acctccttcc tggaagccca 
tgttggccaa agtcttcttc cacctggtgg 
agttatgttg ggtactacag ccagtagagt 
tcctatgttt gaaccatctc aagtttacag 
agaagctcac tattttattc ctcaggaagc 
ccgggtggca gaacagtatg agggcattcc 
gaaacagata cctgaacaga agccagtaca 
tttccatctg gacaggcttt cctggtagga 
ttgggtccgg gatctcccct ctctagtatt 
ttgaaatttc accctgtctt tgttcctcat 
gagagcagaa gcaactgtgt atttgaattt 
gaaggaggtg gaattttacc tcagcgtgtt 
aatcaagaag aactgcctcc tcaatcagtt 
gaagaaaagc ataattacca tgccccagaa 
cagaactggc caataggaag cccagaatat 
gaccccagtg attttcagtc acctccccct 
gacgtctcaa tgccctttat ccatctgcct 
ttctgtttcc cccaaggaac cacatctcag 
ggatctgcac ttcatccaca ggttatagga 
ctttgcacca taacatttaa attttctatt 
aaatattgtt taattttatc agtagagttt 
tatgcattta aaaacaaaat gtgagagaag 
cagtggaaaa aacacaaaaa ctagaatttt 
acgttatttt caaccagtag tgaaaatgca 
tatttaggaa tacaaatctt aagtattact 
ggataaattt tttttctata aaatataaaa 
taatattaat aatacacaga attttcatta 
tcatgggaat aaaaaaagat agaaaaggaa 
aatgcctttt taaaatggca tcagtcaagc 
aaaatgctaa ttcataaaat taagg 



cttgggaggc 

agattgtcac 

aaagaaattt 

ttaactctgt 

gggagagata 

ccaaaaatct 

ctgaaggcct 

taaattaaag 

tcttactctt 

actttttaag 

tttgagagag 

atttcattgg 

ttgttattta 

acccagttgt 

catgttgata 

atcacagata 

ctgcccaccg 

aactcaccac 

cagcccaact 

aactggagag 

tgactataga 

agtgtatgta 

atacaactca 

scrggggccct 

caccttcaga 

tctgcaccta 

tctgcgcct^ 

catgttcaca 

taccgaaatc 

ggattacatg 

ttgaccgttt 

tccagtgatt 

acagggggag 

cagacagtgt 

caggtcttaa 

aaacttccac 

ccttatttcc 

ccccatcttt 

ctacctgatt 

agtcattctt 

agtgtatgta 

ttcttcctcc 

catggaaaat 

gtgtcgaagg 

acagtttagg 

aagttgctgt 



tgaggcagaa 

tgcactccag 

catggttatg 

aacatgtttc 

ggagaaagac 

cttctgattt 

ctaacatatc 

cacacttctt 

cctttaactc 

tgtgggtaaa 

gaaagttatg 

ccctggatgt 

tttattaatt 

gcatgtctga 

cagcctcagt 

tttttcccaa 

aaagtagcca 

ttccaacccc 

aactggacac 

tcatgtgaga 

cctggactag 

gctggggtac 

tcagtactgt 

acttcaagtt 

atctaagatt 

tcagtacaga 

ctgtgttaac 

caccaagctc 

ggcaggttgc 

gctacttgca 

ctgtggtaga 

cctcacaaat 

cagctgcacc 

tacaagaatc 

ctgcctcatt 

aattatttta 

Gtgaatcatg 

ggggggttgt 

tacctattat 

cacaatcacg 

atgtatggtt 

aaacaagttt 

agggaatgct 

atctaaaaag 

aatttgcctt 

gcattattat 



ga^c 



T/CA2003/001716 



gsE^Tcccttg 
cctgggcgaa 
caactcttat 
atgtagtaaa 
attactgtca 
ctggttagaa 
acagggtaat 
acagtataga 
tttttctgtt 
aacgaaataa 
gaaaagcagc 
atttaatgga 
ataaatttag 
tgtaatttca 
ccatggcgca 
ctattcatga 
tttcccagcg 
tgctgaggac 
cagaggccgt 
tacaggtcca 
tacttccaga 
attaccaggc 
caagtcctat 
ctgtctttga 
agattctgga 
tgctacacgt 
tcataacaat 
ctcttcagga 
agtggacttg 
gcctgtgact 
gcctatcgga 
cacttcttca 
ttttggctct 
cccacttttc 
ttcttcagga 
aactacccta 
gattttggag 
gaactacata 
atgtgaaaac 
acttctatgc 
gacccagcat 
ttaaaaaata 
gtttttgagg 
acaaagtata 
aacaaatgaa 
atgtccaaat 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 • 
2760 
2785 



<210> 15 

<211> 2787 

<212> DNA 

<213> Homo sapiens 



<400> 15 

cgtggtggcg 

aacccaggag 

agaggaagac 

ttatgatcag 

aatataataa 

ctagtcaaat 



cacgcgtgta 
gcggaggttg 
tccatctcaa 
aaaaatggac 
aactattaat 
tatatatctt 



gtcccagcta 
cagtaagcca 
aaaaaagaaa 
attttgtgat 
catctagctt 
ttactatcca 



cttgggaggc 
agattgtcac 
aaagaaattt 
ttaactctgt 
gggagagata 
ccaaaaatct 



tgaggcagaa 
tgcactccag 
catggttatg 
aacatgtttc 
ggagaaagac 
cttctgattt 



gaatcccttg 
cctgggcgaa 
caactcttat 
atgtagtaaa 
attactgtca 
ctggttagaa 



60 
120 
180 
240 
300 
360 



20 
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ggcatactat taattga! 
aagaatatag ggaaagttag 
actagtcggg cttttatgcc 
gatgtaattt acattaatgc 
ttacttacaa agtttaattc 
tcttatctaa agcaaagagc 
tttttactat gcacataatt 
tgtaaccatt tcatagggtt 
catatgaatg tatgaattac 
tccgtgtggg gggaccccaa 
acgtccagtt tctttttcac 
gcgtaagagc acctccttcc 
tgttggccaa agtcttcttc 
agttatgttg ggtactacag 
tcctatgttt gaaccatctc 
agaagctcac tattttattc 
ccgggtggca gaacagtatg 
gaaacagata cctgaacaga 
atttccatct ggacaggctt 
attgggtccg ggatctcccc 
tttgaaattt caccctgtct 
tgagagcaga agcaactgtg 
agaaggaggt ggaattttac 
gaatcaagaa gaactgcctc 
ctgaagaaaa gcataattac 
gacagaactg gccaatagga 
cagaccccag tgattttcag 
ctgacgtctc aatgcccttt 
tcttctgttt cccccaagga 
gaggatctgc acttcatcca 
tactttgcac cataacattt 
agaaatattg tttaatttta 
tatatgcatt taaaaacaaa 
accagtggaa aaaacacaaa 
gcacgttatt ttcaaccagt 
attatttagg aatacaaatc 
taggataaat tttttttcta 
ggtaatatta ataatacaca 
tatcatggga ataaaaaaag 
aaaatgcctt tttaaaatgg 
ataaaatgct aattcataaa 



gaaaataaaa 
ttcaatagtt 
ttgttttagt 
ttaagagtga 
ttccatttcc 
ccacagattg 
tccagaagca 
acacagaact 
ttgtcttatt 
catacccaga 
cacctcccac 
tggaagccca 
cacctggtgg 
ccagtagagt 
aagtttacag 
ctcaggaagc 
agggcattcc 
agccagtaca 
tcctggtagg 
tctctagtat 
ttgttcctca 
tatttgaatt 
ctcagcgtgt 
ctcaaatcag 
catgccccag 
agcccagaat 
tcacctccQC 
atccatctgc 
accacatctc 
caggttatag 
aaattttcta 
tcagtagagt 
atgtgagaga 
aactagaatt 
agtgaaaatg 
ttaagtatta 
taaaatataa 
gaattttcat 
atagaaaagg 
catcagtcaa 
attaagg 



ctgaaggcct 
taaattaaag 
tcttactctt 
actttttaag 
tttgagagag 
atttcattgg 
ttgttattta 
acccagttgt 
catgttgata 
atcacagata 
ctgcccaccg 
aactcaccac 
cagcccaact 
aactggagag 
tgactataga 
agtgtatgta 
atacaactca 
agggggccct 
acaccttcag 
ttctgcacct 
ttctgcgcct 
tcatgttcac 
ttaccgaaat 
ttggattaca 
aattgaccgt 
attccagtga 
ctacaggggg 
ctcagacagt 
agcaggtctt 
gaaaacttcc 
ttccttattt 
ttccccatct 
agctacctga 
ttagtcattc 
caagtgtatg 
ctttcttcct 
aacatggaaa 
tagtgtcgaa 
aaacagttta 
gcaagttgct 



ctaacatatc 
cacacttctt 
cctttaactc 
tgtgggtaaa 
gaaagttatg 
ccctggatgt 
tttattaatt 
gcatgtctga 
cagcctcagt 
tttttcccaa 
aaagtagcca 
ttccaacccc 
aactggacac 
tcatgtgaga 
cctggactag 
gctggggtac 
tcagtactgt 
acttcaagtt 
aatctaagat 
atcagtacag 
gctgtgttaa 
acaccaagct 
cggcaggttg 
tggctacttg 
ttctgtggta 
ttcGtcacaa 
agcagctgca 
gttacaagaa 
aactgcctca 
acaattattt 
ccctgaatca 
ttggggggtt 
tttacctatt 
ttcacaatca 
taatgtatgg 
ccaaacaagt 
atagggaatg 
ggatctaaaa 
ggaatttgcc 
gtgcattatt 



# 



T/CA2003/001716 
ggtaat 420 



acagtataga 
tttttctgtt 
aacgaaataa 
gaaaagcagc 
atttaatgga 
ataaatttag 
tgtaatttca 
ccatggcgca 
ctattcatga 
tttcccagcg 
tgctgaggac 
cagaggccgt 
tacaggtcca 
tacttccaga 
attaccaggc 
caagtcctat 
ctgtctttga 
tagattctgg 
atgctacacg 
ctcataacaa 
cctcttcagg 
cagtggactt 
cagcctgtga 
gagcctatcg 
atcacttctt 
ccttttggct 
tccccacttt 
ttttcttcag 
taaactaccc 
tggattttgg 
gtgaactaca 
atatgtgaaa 
cgacttctat 
ttgacccagc 
ttttaaaaaa 
ctgtttttga 
agacaaagta 
ttaacaaatg 
atatgtccaa 



480 

540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2787 



<^10> 16 

<211> 2786 

<212> DNA 

<213> Homo sapiens 



<400> 16 

cgtggtggcg 

aacccaggag 

agaggaagac 

ttatgatcag 

aatataataa 

ctagtcaaat 

ggcatactat 

aagaatatag 

actagtcggg 

gatgtaattt 

ttacttacaa 

tcttatctaa 



cacgcgtgta 
gcggaggttg 
tccatctcaa 
aaaaatggac 
aactattaat 
tatatatctt 
taattgataa 
ggaaagttag 
cttttatgcc 
acattaatgc 
agtttaattc 
agcaaagagc 



gtcccagcta 
cagtaagcca 
aaaaaagaaa 
attttgtgat 
catctagctt 
ttactatcca 
gaaaataaaa 
ttcaatagtt 
ttgttttagt 
ttaagagtga 
ttccatttcc 
ccacagattg 



cttgggaggc 
agattgtcac 
aaagaaattt 
ttaactctgt 
gggagagata 
ccaaaaatct 
ctgaaggcct 
taaattaaag 
tcttactctt 
actttttaag 
tttgagagag 
atttcattgg 



tgaggcagaa 
tgcactccag 
catggttatg 
aacatgtttc 
ggagaaagac 
cttctgattt 
ctaacatatc 
cacacttctt 
cctttaactc 
tgtgggtaaa 
gaaagttatg 
ccctggatgt 



gaatcccttg 
cctgggcgaa 
caactcttat 
atgtagtaaa 
attactgtca 
ctggttagaa 
acagggtaat 
acagtataga 
tttttctgtt 
aacgaaataa 
gaaaagcagc 
atttaatgga 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
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tttttactat gcacati 

tgtaaccatt tcataggg^ 
catatgaatg tatgaattac 
tccgtgtggg gggaccccaa 
acgtccagtt tctttttcac 
gcgtaagagc acctccttcc 
tgttggccaa agtcttcttc 
agttatgttg ggtactacag 
tcctatgttt gaaccatctc 
agaagctcac tattttattc 
ccgggtggca gaacagtatg 
gaaacagata cctgaacaga 
atttccatct ggacaggctt 
attgggtccg ggatctcccc 
tttgaaattt caccctgtct 
tgagagcaga agcaactgtg 
agaaggaggt ggaattttac 
gaatcaagaa gaactgcctc 
tgaagaaaag cataattacc 
acagaactgg ccaataggaa 
agaccccagt gattttcagt 
tgacgtctca atgcccttta 
cttctgtttc ccccaaggaa 
aggatctgca cttcatccac 
actttgcacc ataacattta 
gaaatattgt ttaattttat 
atatgcattt aaaaacaaaa 
ccagtggaaa aaacacaaaa 
cacgttattt tcaaccagta 
ttatttagga atacaaatct 
aggataaatt ttttttctat 
gtaatattaa taatacacag 
atcatgggaa taaaaaaaga 
aaatgccttt ttaaaatggc 
taaaatgcta attcataaaa 



tccagaagca 
acacagaact 
ttgtcttatt 
catiacccaga 
cacctcccac 
tggaagccca 
cacctggtgg 
ccagtagagt 
aagtttacag 
ctcaggaagc 
agggcattcc 
agcpagtaca 
tcctggtagg 
tctctagtat 
ttgttcctca 
tatttgaatt 
ctcagcgtgt 
ctcaatcggt 
atgccccaga 
gcccagaata 
cacctccccc 
tccatctgcc 
ccacatctca 
aggttatagg 
aattttctat 
cagtagagtt 
tgtgagagaa 
actagaattt 
gtgaaaatgc 
taagtattac 
aaaatataaa 
aattttcatt 
tagaaaagga 
atcagtcaag 
ttaagg 



ttgttattta 
acccagttgt 
catgttgata 
atcacagata 
ctgcccaccg 
aactcaccac 
cagcccaact 
aactggagag 
tgactataga 
agtgtatgta 
atacaactca 
agggggccct 
acaccttcag 
ttctgcacct 
ttctgcgcct 
tcatgttcac 
ttaccgaaat 
tggattacat 
attgaccgtt 
ttccagtgat 
tacaggggga 
tcagacagtg 
gcaggtctta 
aaaacttcca 
tccttatttc 
tccccatctt 
gctacctgat 
tagtcattct 
aagtgtatgt 
tttcttcctc 
acatggaaaa 
agtgtcgaag 
aacagtttag 
caagttgctg 



tttattaatt 
gcatgtctga 
cagcctcagt 
tttttcccaa 
aaagtagcca 
ttccaacccc 
aactggacac 
tcatgtgaga 
cctggactag 
gctggggtac 
tcagtactgt 
acttcaagtt 
aatctaagat 
atcagtacag 
gctgtgttaa 
acaccaagct 
cggcaggttg 
ggctacttgt 
tctgtggtag 
tcctcacaaa 
gcagctgcac 
ttacaagaat 
actgcctcat 
caattatttt 
cctgaatcat 
tggggggttg 
ttacctatta 
tcacaatcac 
aatgtatggt 
caaacaagtt 
tagggaatgc 
gatctaaaaa 
gaatttgcct 
tgcattatta 



T/CA2003/001716 
at t tag 780 



tgtaatttca 
ccatggcgca 
ctattcatga 
tttcccagcg 
tgctgaggac 
cagaggccgt 
tacaggtcca 
tacttccaga 
attaccaggc 
caagtcctat 
ctgtctttga 
tagattctgg 
atgctacacg 
ctcataacaa 
cctcttcagg 
cagtggactt 
agcctgtgac 
agcctatcgg 
tcacttcttc 
cttttggctc 
ccccactttt 
tttcttcagg 
aaactaccct 
ggattttgga 
tgaactacat 
tatgtgaaaa 
gacttctatg 
tgacccagca 
tttaaaaaat 
tgtttttgag 
gacaaagtat 
taacaaatga 
tatgtccaaa 



840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2786 



<210> 17 
<211> 2365 
<212> DNA 
<213> Bus scrofa 

<400> 17 

gaattcggca cgagggtatc caccaaaaat ctcttctgat ttctaggtag aagacacact 60 

attagttaat cagaaattaa ctctggagcc tctaacagca cacaacaata aagcttcaga 120 

ataggaaaac ttggtacagt ctcattctta cttaactctg tttccactga tgcagtttac 180 

attaagtcat gagtgaacat attaattgtg ggtaaaagca gaataatcac ttaggaagtt 240 

aaatccttcc atttcccttg agagaggaga attgtggaaa agcagctcct atctaatgca 300 

aagagtccca cagaataatt ttattgaccc tggatgtatt taatggattt ttttttttta 360 

ctgtgcacat aatttctgaa agcattgtta tttctttatt aattataaat ttggtgtaac 420 

catttcgtag ggttacacag agctacccag ttatgcatgt ctaatgtaat ttcacatatg 480 

aatgtatgaa ttacttgtct tattcatgtt gatacagcct cagtccatgg cgcatccgcg 540 

tggggggacc ccaacatacc cagaatcaca gatatttttc ccaactattc atgaacgtcc 600 

agtttctttt tcaccacctc ccacctgtcc accgaaggtg gctatttccc agcggcgtaa 660 

gagcacctcc ttcctggaag cccaaactca tcacttccaa cccctgctga ggactgttgg 720 

ccaaaatctt cttccacctg gtggctgccc aactaactgg acaccagagg ccgtagttat 780 

gttgggtact acagccagta gagtaactgg agagccatgt gagatacagg tccaacctct 840 

gtttgagcct actcaagttt atggtgactg tagacctgga ctagtacttc cagaagaagc 900 

tcactacttt attcctcagg aagcagtgta tgtagcaggg gtacattacc agacccaaat 960 

ggcagaacag tttgagggta ttccatacaa ctctccagtc ctgtcaagtc ctatgaaaca 1020 

gatacctgaa cagaagccag tgcaaggggg ccctccttcg agttctgtct ttgaatttcc 1080 
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atctggacag gctttccl^M^aggacatct 

tccaggatct cccctctcta gcatttctac 

atttcaccct gtctttgttc ctcattctgc 

cagaagcaat tgtgtatttg agtttcatgt 

aggagtttta cctcagcgta tttaccgaaa 

agaaccacct cctcaatcag ctggattaca 

acataatttc cagcccccag aattgaccgt 

gccaatagga agcccagaat attccagtga 

tgattttcaa tcacctcccc ctacaggggg 

attacccttt atccatctgc ctcagacagt 

cccccaagga accacttctc cgcagatttt 

actccatcca caggttatag gaaaacttcc 

cattgcattt aagttttctg tccctccttt 

tacaatttct ccattttgtg tgtgtgttgg 

aacaaaatct gagagcaagc ctgcctgatt 

gaaactagaa ttttagttac ttgtcaaaat 

tcaactagaa gtgaaatgta aatgtctgta 

actaatgtga agtatttctt tcttcctaga 

taacattttt ttttcctgta aaatgaaaag 

gtagtattag gagtacatac agaaattaaa 

cctttttaaa atggcatcaa acaaatcaat 

ccataaaaaa aaaaaaaaaa aaaaa 



tcagaattta 
acctatcagt 
acctgctgtg 
tcatactcca 
tcgacaggtt 
cggccgcctg 
ttctgtggta 
ttcctcacaa 
aacagctgca 
gatacaagaa 
atctgcatca 
acagttctct 
ccattcatga 

gggggttatt 

tacctataat 
cagtgacttc 
aagttgactc 
aatgaacgtg 
aatgggaagt 
agagaggaaa 
ttgtattatg 



cgattagatt 
acagatgcta 
ttaactcata 
agctcctctt 
gccgtggact 
cagcctgtga 
gagcctactg 
atcacttctt 
ccttttggct 
tccccacttt 
ttttcttcag 
taaactaccc 
ggattattgt 
ttaaactact 
caaaccattg 
tgttcatatg 
agaattgatg 
gctagttttt 
ggagaatcct 
catcttaata 
tgttcaaata 



T/CA2003/001716 

ctaag 114 U 



cacgtttgaa 
acaatgagag 
caggagaagg 
tgaatcagga 
ctgaagaaca 
gacagagctg 
cagaccccag 
ctgacgtctc 
tcttctgttt 
gaggatctgc 
tgctgtgcac 
ttaattttat 
tgtatttaag 
gaaaagaaaa 
tagaacgttt 
cttcacacat 
taagagaaga 
ttttctaaaa 
agtgaagttg 
aaatgttaat 



1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2365 



<210> 18 

<211> 433 

<212> PRT 

<213> Sus scrofa 

<400> 18 

Met Tyr Glu Leu Leu Val Leu Phe Met Leu lie Gin Pro Gin Ser Met 

1 5 . 10 15 

Ala His Pro Arg Gly Gly Thr Pro Thr Tyr. Pro Glu Ser Gin lie Phe 
20 .25 30 

Phe Pro Thr lie His Glu Arg Pro Val Ser Phe Ser Pro Pro Pro Thr 
35 40 45 

Cys Pro Pro Lys Val Ala He Ser Gin Arg. Arg Lys Ser Thr Ser Phe 
50 55 - 60 . 

Leu Glu Ala Gin Thr His His Phe Gin Pro Leu Leu. Arg Thr Val Gly 
65 70 75 80 

Gin Asn Leu Leu Pro Pro Gly Gly Cys Pro Thr Asn Trp Thr Pro Glu 
85 90 95 

Ala Val Val Met Leu Gly Thr Thr Ala Ser Arg Val Thr Gly Glu Pro 
100 105 110 

Cys Glu He Gin Val Gin Pro Leu Phe Glu Pro Thr Gin Val Tyr Gly 
115 120 125 

Asp Cys Arg Pro Gly Leu Val Leu Pro Glu Glu Ala His Tyr Phe He 
130 135 140 

Pro Gin Glu Ala Val Tyr Val Ala Gly Val His Tyr Gin Thr Gin Met 
145 150 155 ^ 160 

Ala Glu Gin Phe Glu Gly He Pro Tyr Asn Ser Pro Val Leu Ser Ser 
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i6l^F 170 lir 

Pro Met Lys Gin lie Pro Glu Gin Lys Pro Val Gin Gly Gly Pro Pro 
180 185 190 

Ser Ser Ser Val Phe Glu Phe Pro Ser Gly Gin Ala Phe Leu Val Gly 
195 200 205 

His Leu Gin Asn Leu Arg Leu Asp Ser Gly Leu Ser Pro Gly Ser Pro 
210 215 220 

Leu Ser Ser lie Ser Thr Pro lie Ser Thr Asp Ala Thr Arg Leu Lys 
225 230 235 240 

Phe His Pro Val Phe Val Pro His Ser Ala Pro Ala Val Leu Thr His 
245 250 255 

Asn Asn Glu Ser Arg Ser Asn Cys Val Phe Glu Phe His Val His Thr 
260 265 270 

Pro Ser Ser Ser Ser Gly Glu Gly Gly Val Leu Pro Gin Arg He Tyr 
275 280 285 

Arg Asn Arg Gin Val Ala Val Asp Leu Asn Gin Glu Glu Pro Pro Pro 
290 295 300 

Gin Ser Ala Gly Leu His Gly Arg Leu Gin Pro Val Thr Glu Glu Gin 
305 310 315 320 

His Asn Phe Gin Pro Pro Glu Leu Thr Val Ser Val Val Glu Pro Thr 
325 330 335 

Gly Gin Ser Trp Pro He Gly Ser Pro Glu Tyr Ser Ser Asp Ser Ser 
340 345 350 

Gin He Thr Ser Ser Asp Pro Ser Asp Phe Gin Ser Pro Pro Pro Thr 
355 360 365 

Gly Gly Thr Ala Ala Pro Phe Gly Ser Asp Val Ser Leu Pro Phe He 
370 375 380 

His Leu Pro Gin Thr Val He Gin Glu Ser Pro Leu Phe Phe Cys Phe 
385 390 395 400 

Pro Gin Gly Thr Thr Ser Pro Gin He Leu Ser Ala Ser Phe Ser Ser 
405 410 415 

Gly Gly Ser Ala Leu His Pro Gin Val He Gly Lys Leu Pro Gin Phe 
420 425 430 

Ser 



<210> 19 

<211> 426 

<212> PRT 

<213> Homo sapiens 

<400> 19 

Met Leu He Gin Pro Gin Ser Met Ala His Pro Cys Gly Gly Thr Pro 
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1 



5 




10 



IT/CA2003/001716 



Thr Tyr Pro Glu Ser Gin lie Phe Phe Pro Thr lie His Glu Arg Pro 
20 25 30 

Val Ser Phe Ser Pro Pro Pro Thr Cys Pro Pro Lys Val Ala He Ser 
35 40 45 

Gin Arg Arg Lys Ser Thr Ser Phe Leu Glu Ala Gin Thr His His Phe 
50 55 60 

Gin Pro Leu Leu Arg Thr Val Gly Gin Ser Leu Leu Pro Pro Gly Gly 
65 70 75 80 

Ser Pro Thr Asn Trp Thr Pro Glu Ala Val Val Met Leu Gly Thr Thr 
85 90 95 

Ala Ser Arg Val Thr Gly Glu Ser Cys Glu He Gin Val His Pro Met 
100 105 110 • 

Phe Glu Pro Ser Gin Val Tyr Ser Asp Tyr Arg Pro Gly Leu Val Leu 
115 120 125 

Pro Glu Glu Ala His Tyr Phe He Pro Gin GlU Ala Val Tyr Val Ala 
130 135 140 

Gly Val His Tyr Gin Ala Arg Val Ala Glu Gin Tyr Glu Gly He Pro 
145 150 155 160 

Tyr Asn Ser Ser Val Leu Ser Ser Pro Met Lys Gin He Pro Glu Gin 
165 170 175 

Lys Pro Val Gin Gly Gly Pro Thr Ser Ser Ser Val Phe Glu Phe Pro 
180 185 190 

Ser Gly Gin Ala Phe Leu Val Gly His Leu Gin Asn Leu Arg Leu Asp 
195 200 205 

Ser Gly Leu Gly Pro Gly Ser Pro Leu Ser Ser He Ser Ala Pro He 
210 215 220 

Ser Thr Asp Ala Thr Arg Leu Lys Phe His Pro Val Phe Val Pro His 
225 230 235 240 

Ser Ala Pro Ala Val Leu Thr His Asn Asn Glu Ser Arg Ser Asn Cys 
245 250 255 

Val Phe Glu Phe His Val His Thr Pro Ser Ser Ser Ser Gly Glu Gly 
260 265 270 

Gly Gly He Leu Pro Gin Arg Val Tyr Arg Asn Arg Gin Val Ala Val 
275 280 285 

Asp Leu Asn Gin Glu Glu Leu Pro Pro Gin Ser Val Gly Leu His Gly 
290 295 300 

Tyr Leu Gin Pro Val Thr Glu Glu Lys His Asn Tyr His Ala Pro Glu 
305 310 315 ^ . 320 

Leu Thr Val Ser Val Val Glu Pro He Gly Gin Asn Trp Pro He Gly 



325 



330 



335 
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Ser Pro Glu Tyr Ser ser Asp Ser Ser Gin lie Thr Ser Ser Asp Pro 
340 345 350 

Ser Asp Phe Gin Ser Pro Pro Pro Thr Gly Gly Ala Ala Ala Pro Phe 
355 360 365 

Gly Ser Asp Val Ser Met Pro Phe lie His Leu Pro Gin Thr Val Leu 
370 375 380 

Gin Glu Ser Pro Leu Phe Phe Cys Phe Pro Gin Gly Thr Thr Ser Gin 
385 390 395 400 

Gin Val Leu Thr Ala Ser Phe Ser Ser Gly Gly Ser Ala Leu His Pro 
405 410 415 

Gin Val lie Gly Lys Leu Pro Gin Leu Phe 
420 425 



<210> 20 

<211> 419 

<212> PRT 

<213> Homo sapiens 

<400> 20 

Met Ala His Pro Cys Gly Gly Thr Pro Thr Tyr Pro Glu Ser Gin lie 
1 5 . 10 15 

Phe Phe Pro Thr He His Glu Arg Pro Val Ser Phe Ser Pro Pro Pro 
20 25 30 

Thr Cys Pro Pro Lys Val Ala He Ser Gin Arg Arg Lys Ser Thr Ser 
35 40 45 

Phe Leu Glu Ala Gin Thr His His Phe Gin Pro Leu Leu Arg Thr Val 
50 55 60 

Gly Gin Ser Leu Leu Pro Pro Gly Gly Ser Pro Thr Asn Trp Thr Pro 
•65 - 70- 75 *80 

Glu Ala Val Val Met Leu Gly Thr Thr Ala Ser Arg Val Thr Gly Glu 
85 90 95 

Ser Cys Glu He Gin Val His Pro Met Phe Glu Pro Ser Gin Val Tyr 
100 105 110 

Ser Asp Tyr Arg Pro Gly Leu Val Leu Pro Glu Glu Ala His Tyr Phe 
115 ' 120 - 125 

He Pro Gin Glu Ala Val Tyr Val Ala Gly Val His Tyr Gin Ala Arg 
130 135 140 

Val Ala Glu Gin Tyr Glu Gly He Pro Tyr Asn Ser Ser Val Leu Ser 
145 150 155 160 

Ser Pro Met Lys Gin He Pro Glu Gin Lys Pro Val Gin Gly Gly Pro 
165 170 ^ 175 

Thr Ser Ser Ser Val Phe Glu Phe Pro Ser Gly Gin Ala Phe Leu Val 
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180 185 190 ^^ 



Gly His Leu Gin Asn Leu Arg Leu Asp Ser Gly Leu Gly Pro Gly Ser 
195 200 205 

Pro Leu Ser Ser lie Ser Ala Pro lie Ser Thr Asp Ala Thr Arg Leu 
210 215 220 

Lys Phe His Pro Val Phe Val Pro His Ser Ala Pro Ala Val Leu Thr 
225 230 235 240 

His Asn Asn Glu Ser Arg Ser Asn Cys Val Phe Glu Phe His Val His 
245 250 255 

Thr Pro Ser Ser Ser Ser Gly Glu Gly Gly Gly lie Leu Pro Gin Arg 
260 265 270 

Val Tyr Arg Asn Arg Gin Val Ala Val Asp Leu Asn Gin Glu Glu Leu 
275 280 285 

Pro Pro Gin Ser Val Gly Leu His Gly Tyr Leu Gin Pro Val Thr Glu 
290 295 300 

Glu Lys His Asn Tyr His Ala Pro Glu Leu Thr Val Ser Val Val Glu 
305 310 315 320 

Pro lie Gly Gin Asn Trp Pro lie Gly Ser Pro Glu Tyr Ser Ser Asp 
325 330 335 

Ser Ser Gin lie Thr Ser Ser Asp Pro Ser Asp Phe Gin Ser Pro Pro 
340 345 350 

Pro Thr Gly Gly Ala Ala Ala Pro Phe Gly Ser Asp Val Ser Met Pro 
355 360 365 

Phe lie His Leu Pro Gin Thr Val Leu Gin Glu Ser Pro Leu Phe Phe 
370 375 380 

Cys Phe Pro Gin Gly Thr Thr Ser Gin Gin Val Leu Thr Ala Ser Phe 
385 390 395 400 

Ser Ser Gly Gly Ser Ala Leu His Pro Gin Val lie Gly Lys Leu Pro 
405 410 415 

Gin Leu Phe 



<210> 21 

<211> 425 

<212> PRT 

<213> Mus musculus 

<400> 21 

Met Leu lie Gin Pro Gin Ser Met Ala His Pro Cys Gly Gly Thr Pro 
15 10 15 

Thr Tyr Pro Glu Ser Gin lie Phe Phe Pro Thr lie His Glu Arg Pro 
20 25 ^ 30 

Val Ser Phe Ser Pro Pro Pro Thr Cys Pro Pro Lys Val Ala lie Ser 
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35 . 40 45 

Gin Arg Arg Lys Ser Thr Ser Phe Leu Glu Ala Gin Thr Arg His Phe 
50 55 60 

Gin Pro Leu Leu Arg Thr Val Gly Gin Asn His Leu Pro Pro Gly Ser 
65 70 75 80 

Ser Pro Thr Asn Trp Thr Pro Glu Ala He Val Met Leu Gly Ala Thr 
85 90 95 

Ala Asn Arg Val Asn Arg Glu Leu Cys Glu Met Gin Val Gin Pro Val 
100 105 110 

Phe Glu Pro Thr Gin He Tyr Ser Asp Tyr Arg Pro Gly Leu Val Leu 
115 120 ■ 125 

Ala Glu Glu Ala His Tyr Phe He Pro Gin Glu Thr Val Tyr Leu Ala 
130 135 140 

Gly Val His Tyr Gin Ala Gin Val Ala Gly Gin Tyr Glu Gly He Ser 
145 150 155 160 

Tyr Asn Ser Pro Val Leu Ser Ser Pro Met Lys Gin He Ser Glu Gin 

165 170 175 

Lys Pro Val Pro Gly Gly Pro Ala Ser Ser Ser Val Phe Glu Phe Pro 
180 185 190 

Ser Gly Gin Ala Phe Leu Val Gly His Leu Gin Asn Leu Arg Leu Asp 
195 200 205 

Ser Gly Pro Ser Pro Ala Ser Pro Leu Ser Ser He Ser Ala Pro Asn 
210 215 220 

Ser Thr Asp Ala Thr His Leu Lys Phe His Pro Val Phe Val Pro His 
225 230 235 - 240 

Ser Ala Pro Ala Val Leu Thr Asn Ser Asn Glu Asn Arg Ser Asn Cys 
245 250 255 

Val Phe Glu Phe His Ala Gin Thr Pro Ser Ser Ser Gly Glu Gly Gly 
260 265 270 

Gly He Leu Pro Gin Arg Val Tyr Arg Asn Arg Gin Val Ala Val Asp 
275 280 285 

Ser Asn Gin Glu Glu Leu Ser Pro Gin Ser Val Gly Leu His Cys His 
290 295 300 

Leu Gin Pro Val Thr Glu Glu Gin Arg Asn Asn His Ala Pro Glu Leu 
305 310 315 320 

Thr He Ser Val Val Glu Pro Met Gly Gin He Trp Pro He Gly Ser 
325 330 335 

Pro Glu Tyr Ser Ser Asp Ser Ser Gin He Thr Ser Ser Asp Leu Ser 
340 345 ^ 350 

Asp Phe Gin Ser Pro Pro Pro Thr Gly Gly Thr Ala Ala Pro Phe Gly 
355 360 365 
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Ser Asp Val Ser Leu Pro Phe He Arg Leu Pro Gin Thr Val Leu Gin 
370 375 380 

Glu Ser Pro Leu Phe Phe Cys Phe Pro Gin Gly Thr Thr Ser Gin Gin 
385 390 395 400 

Val Leu Ser Ala Ser Tyr Ser Ser Gly Gly Ser Thr Leu His Pro Gin 
405 410 415 

Val He Gly Lys Leu Ser Gin Phe Phe 
420 425 



<210> 22 

<211> 418 

<212> PRT 

<213> Mus musculus 

<400> 22 

Met Ala His Pro Cys Gly Gly Thr Pro Thr Tyr Pro Glu Ser Gin He 
15 10 15 

Phe Phe Pro Thr He His Glu Arg Pro Val Ser Phe Ser Pro Pro Pro 
20 25 30 

Thr Cys Pro Pro Lys Val Ala He Ser Gin Arg Arg Lys Ser Thr Ser 
35 40 45 

Phe Leu Glu Ala Gin Thr Arg His Phe Gin Pro Leu Leu Arg Thr Val 
50 55 60 

Gly Gin Asn His Leu Pro Pro Gly Ser Ser Pro Thr Asn Trp Thr Pro 
65 70 75 80 

Glu Ala He Val Met Leu Gly Ala Thr Ala Asn Arg Val Asn Arg Glu 
85 90 95 

Leu Cys Glu Met Gin Val Gin Pro Val Phe Glu Pro Thr Gin He Tyr 
100 105 110 

Ser Asp Tyr Arg Pro Gly Leu Val Leu Ala Glu Glu Ala His Tyr Phe 
115 120 125 

He Pro Gin Glu Thr Val Tyr Leu Ala Gly Val His Tyr Gin Ala Gin 
130 135 140 

Val Ala Gly Gin Tyr Glu Gly He Ser Tyr Asn Ser Pro Val Leu Ser 
145 150 155 160 

Ser Pro Met Lys Gin He Ser Glu Gin Lys Pro Val Pro Gly Gly Pro 
165 170 175 

Ala Ser Ser Ser Val Phe Glu Phe Pro Ser Gly Gin Ala Phe Leu Val 
180 185 190 

Gly His Leu Gin Asn Leu Arg Leu Asp Ser Gly Pro Set Pro Ala Ser 
195 200 205 

Pro Leu Ser Ser He Ser Ala Pro Asn Ser Thr Asp Ala Thr His Leu 
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210 215 220 



Lys Phe His Pro Val Phe Val Pro His Ser Ala Pro Ala Val Leu Thr 
225 230 235 240 

Asn Ser Asn Glu Asn Arg Ser Asn Cys Val Phe Glu Phe His Ala Gin 
245 250 255 

Thr Pro Ser Ser Ser Gly Glu Gly Gly Gly lie Leu Pro Gin Arg Val 
260 265 270 

Tyr Arg Asn Arg Gin Val Ala Val Asp Ser Asn Gin Glu Glu Leu Ser 
275 280 285 

Pro Gin Ser Val Gly Leu His Cys His Leu Gin Pro Val Thr Glu Glu 
290 295 300 

Gin Arg Asn Asn His Ala Pro Glu Leu Thr lie Ser Val Val Glu Pro 
305 310 315 320 

Met Gly Gin lie Trp Pro lie Gly Ser Pro Glu Tyr Ser Ser Asp Ser 
325 330 335 

Ser Gin lie Thr Ser Ser Asp Leu Ser Asp Phe Gin Ser Pro Pro Pro 
■ 340 345 • 350 

Thr Gly Gly Thr Ala Ala Pro Phe Gly Ser Asp Val Ser Leu Pro Phe 
355 360 365 

lie Arg Leu Pro Gin Thr Val Leu Gin Glu Ser Pro Leu Phe Phe Cys 
370 375 380 

Phe Pro Gin Gly Thr Thr Ser Gin Gin Val Leu Ser Ala Ser Tyr Ser 
385 390 395 400 

Ser Gly Gly Ser Thr Leu His Pro Gin Val lie Gly Lys Leu Ser Gin 
405 410 415 

Phe Phe 



<210> 23 

<211> 426 

<212> PRT 

<213> Ratus ratus 

<400> 23 

Met Leu lie Gin Pro Gin Ser Val Ala His Pro Cys Gly Gly Thr Pro 
15 10 15 

Thr Tyr Pro Glu Ser Gin lie Phe Phe Pro Thr lie His Glu Arg Pro 
20 25 30 

Val Ser Phe Ser Pro Pro Pro Thr Cys Pro Pro Lys Val Ala lie Ser 
35 40 45 

Gin Arg Arg Lys Ser Thr Ser Phe Leu Glu Ala Gin Thjr Arg His Phe 
50 55 60 

Gin Pro Leu Leu Arg Thr Val Gly Gin Asn His Leu Pro Pro Gly Gly 
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65 75 ^^80 

Ser Pro Thr Asn Trp Thr Pro Glu Ala He Val Met Leu Gly Thr Thr 
85 90 95 

Ala Asn Arg Val Asn Arg Glu Leu Cys Glu Met Gin Val Gin Pro Val 
100 105 110 

Phe Glu Thr Thr Gin He Tyr Ser Asp Tyr Arg Pro Gly Leu Val Leu 
115 120 125 

Ala Glu Glu Ala His Tyr Phe He Pro Gin Glu' Thr Val Tyr Leu Ala 
130 135 140 

Gly Val His Tyr Gin Ala His Ala Ala Gly Gin Tyr Glu Gly He Ser 
145 150 155 160 

Tyr Asn Ser Pro Val Leu Ser Ser Pro Met Lys Gin He Thr Glu Gin 
165 170 175 

Lys Pro Val Pro Gly Cys Pro Ala Ser Ser Ser Val Phe Glu Phe Pro 
180 185 190 

Ser Gly Gin Ala Phe Leu Val Gly His Leu Gin Asn Leu Arg Leu Asp 
195 200 205 

Ser Gly Pro Ser Pro Ala Ser Pro Leu Ser Ser He Ser Ala Pro Asn 
210 215 220 

Ser Thr Asp Ala Thr His Leu Lys Phe His Pro Val Phe Val Pro His 
225 230 235 240 

Ser Ala Pro Ala Val Leu Thr His Ser Asn Glu Asn Arg Ser Asn Cys 
245 250 255 

Val Phe Glu Phe His Ala Gin Thr Pro Ser Ser Ser Ser Gly Glu Gly 
260 265 270 

Gly Gly He Leu Pro Gin Arg Val Tyr Arg Asn Arg Gin Val Ala Val 
275 280 285 

Asp Ser Ser Gin Glu Glu Leu Ser Pro Gin Ser Val Gly Leu His Cys 
290 295 300 

His Leu Gin Pro Val Thr Glu Glu Gin Arg Asn Asn His Thr Pro Glu 
305 310 315 320 

Leu Thr He Ser Val Val Glu Pro Met Gly Gin Asn Trp Pro Val Gly 
325 330 335 

Ser Pro Glu Tyr Ser Ser Asp Ser Ser Gin He Thr Ser Ser Asp He 
340 345 350 

Ser Asp Phe Gin Ser Pro Pro Pro Thr Gly Gly Thr Ala Ala Pro Phe 
355 360 365 

Gly Ser Asp Val Ser Leu Pro Tyr He Arg Leu Pro Gin Thr Val Leu 
370 375 380 

Gin Glu Ser Pro Leu Phe Phe Cys Phe Pro Gin Gly Thr Thr Ser Gin 
385 390 395 400 
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Gin Val 



lie Gly Lys Leu Ser Gin Phe Phe 
420 425 



<210> 24 

<211> 24 

<212> DNA 

<213> Artificial 

<220> 

<223> Forward amplification Primer 

<400> 24 

ttccagaagc attgttattt attt 24 



<210> 25 

<211> 19 

<212> DNA 

<213> Artificial 

<220> 

<223> Reverse Replication Primer 



<210> 26 

<211> 23 

<212> DNA 

<213> Artificial 

<220> 

<223> Forward Replication Primer 

<400> 26 

caccagaggc cgtagttatg ttg 23 

<210> 27 

<211> 24 

<212> DNA 

<213> Artificial 

<220> 

<223> Reverse replication primer. 



<400> 25 

cccccttgta ctggcttct 



19 



<400> 27 

ttgaggaggc agttcttctt gatt 



24 



<210> 28 
<211> 23 
<212> DNA 



<213> Artificial 
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<220> 

<223> Forward replication primer. 



<400> 28 

gcgcctgctg tgttaactca taa 



23 



<210> 29 

<211> 24 

<212> DNA 

<213> Artificial 

<220> 

<223> Reverse replication primer, 

<400> 29 

ccaaagatgg ggaaactcta ctga 24 

<210> 30 

<211> 23 

<212> DNA 

<213> Artificial 

<220> 

<223> Forward replication primer. 



<210> 31 

<211> 22 

<212> DNA 

<213> Artificial 

<220> 

<223> Reverse replication primer. 

<400> 31 

tgcaacaaat gtaccactct gg 22 

<210> 32 

<211> 16 

<212> DNA 

<213> Artificial 

<220> 

<223> Forward replication primer. 

<400> 32 

agctcctaaa tccccg i6 

<210> 33 

<211> 21 

<212> DNA 

<213> Artificial 



<400> 30 



accatcacct aaggagacag acc 



23 



<220> 
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<223> Reverse repJ^Pfcion primer, 

<400> 33 

gccatgtcta taaataccct g 21 
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